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Preface 



The core of this book, Chapters 3 through 5, presents a course on metric, 
normed, and Hilbert spaces at the senior/graduate level. The motivation for 
each of these chapters is the generalisation of a particular attribute of the 
Euclidean space R": in Chapter 3, that attribute is distance] in Chapter 4, 
length] and in Chapter 5, inner product. In addition to the standard topics 
that, arguably, should form part of the armoury of any graduate student 
in mathematics, physics, mathematical economics, theoretical statistics,..., 
this part of the book contains many results and exercises that are seldom 
found in texts on analysis at this level. Examples of the latter are Wong’s 
Theorem (3.3.12) showing that the Lebesgue covering property is equivalent 
to the uniform continuity property, and Motzkin’s result (5.2.2) that a 
nonempty closed subset of Euclidean space has the unique closest point 
property if and only if it is convex. 

The sad reality today is that, perceiving them as one of the harder parts 
of their mathematical studies, students contrive to avoid analysis courses at 
almost any cost, in particular that of their own educational and technical 
deprivation. Many universities have at times capitulated to the negative 
demand of students for analysis courses and have seriously watered down 
their expectations of students in that area. As a result, mathematics ma- 
jors are graduating, sometimes with high honours, with little exposure to 
anything but a rudimentary course or two on real and complex analysis, 
often without even an introduction to the Lebesgue integral. 

For that reason, and also in order to provide a reference for material 
that is used in later chapters, I chose to begin this book with a long 
chapter providing a fast-paced course of real analysis, covering conver- 
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gence of sequences and series, continuity, differentiability, and (Riemann 
and Riemann-Stieltjes) integration. The inclusion of that chapter means 
that the prerequisite for the book is reduced to the usual undergraduate 
sequence of courses on calculus. (One-variable calculus would suffice, in 
theory, but a lack of exposure to more advanced calculus courses would in- 
dicate a lack of the mathematical maturity that is the hidden prerequisite 
for most senior/graduate courses.) 

Chapter 2 is designed to show that the subject of differentiation does 
not end with the material taught in calculus courses, and to introduce the 
Lebesgue integral. Starting with the Vitali Covering Theorem, the chap- 
ter develops a theory of differentiation almost everywhere that underpins a 
beautiful approach to the Lebesgue integral due to F. Riesz [39]. One minor 
disadvantage of Riesz’s approach is that, in order to handle multivariate 
integrals, it requires the theory of set-valued derivatives, a topic sufficiently 
involved and far from my intended route through elementary analysis that 
I chose to omit it altogether. The only place where this might be regarded 
as a serious omission is at the end of the chapter on Hilbert space, where 
I require classical vector integration to investigate the existence of weak 
solutions to the Dirichlet Problem in three-dimensional Euclidean space; 
since that investigation is only outlined, it seemed justifiable to rely only 
on the reader’s presumed acquaintance with elementary vector calculus. 
Certainly, one-dimensional integration is all that is needed for a sound in- 
troduction to the Lp spaces of functional analysis, which appear in Chapter 
4. 

Chapters 1 and 2 form Part I (Real Analysis) of the book; Part II (Ab- 
stract Analysis) comprises the remaining chapters and the appendices. I 
have already summarised the material covered in Chapters 3 through 5. 
Chapter 6, the final one, introduces functional analysis, starting with the 
Hahn-Banach Theorem and the consequent separation theorems. As well 
as the common elementary applications of the Hahn-Banach Theorem, I 
have included some deeper ones in duality theory. The chapter ends with 
the Baire Category Theorem, the Open Mapping Theorem, and their con- 
sequences. Here most of the applications are standard, although one or two 
unusual ones are included as exercises. 

The book has a preliminary section dealing with background material 
needed in the main text, and three appendices. The first appendix de- 
scribes Bishop’s construction of the real number line and the subsequent 
development of its basic algebraic and order properties; the second deals 
briefly with axioms of choice and Zorn’s Lemma; and the third shows how 
some of the material in the chapters — in particular, Minkowski’s Separation 
Theorem — can be used in the theory of Pareto optimality and competitive 
equilibria in mathematical economics. Part of my motivation in writing 
Appendix C was to indicate that “mathematical economics” is a far deeper 
subject than is suggested by the undergraduate texts on calculus and linear 
algebra that are published under that title. 
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I have tried, wherever possible, to present proofs so that they translate 
mutatis mutandis into their counterparts in a more abstract setting, such 
as that of a metric space (for results in Chapter 1) or a topological space 
(for results in Chapter 3). On the other hand, some results first appear 
as exercises in one context before reappearing as theorems in another: one 
example of this is the Uniform Continuity Theorem, which first appears as^ 
Exercise (1.4.8: 8) in the context of a compact interval of R, and which is 
proved later, as Corollary (3.3.13), in the more general setting of a compact 
metric space. I hope that this procedure of double exposure will enable 
students to grasp the material more firmly. 

The text covers just over 300 pages, but the book is, in a sense, much 
larger, since it contains nearly 750 exercises, which can be classified into at 
least the following, not necessarily exclusive, types: 

• applications and extensions of the main propositions and theorems; 

• results that fill in gaps in proofs or that prepare for proofs later in 
the book; 

• pointers towards new branches of the subject; 

• deep and difficult challenges for the very best students. 

The instructor will have a wide choice of exercises to set the students as 
assignments or test questions. Whichever ones are set, as with the learning 
of any branch of mathematics it is essential that the student attempt as 
many exercises as the constraints of time, energy, and ability permit. 

It is important for the instructor / student to realise that many of the 
exercises — especially in Chapters 1 and 2 — deal with results, sometimes 
major ones, that are needed later in the hook. Such an exercise may not 
clearly identify itself when it first appears; if it is not attempted then, it 
will provide revision and reinforcement of that material when the student 
needs to tackle it later. It would have been unreasonable of me to have 
included major results as exercises without some guidelines for the solution 
of the nonroutine ones; in fact, a significant proportion of the exercises of 
all types come with some such guideline, even if only a hint. 

Although Chapters 3 through 6 make numerous references to Chapters 1 
and 2, 1 have tried to make it easy for the reader to tackle the later chapters 
without ploughing through the first two. In this way the book can be used 
as a text for a semester course on metric, normed, and Hilbert spaces. (If 



reference of the form Proposition (a.b.c) is to Proposition c in Section b of 
Chapter a; one to Exercise (a.b.c: d) is to the rfth exercise in the set of exercises 
with reference number (a.b.c); and one to (B3) is to the 3rd result in Appendix 
B. Within each section, displays that require reference indicators are numbered 
in sequence: (1), (2), .... The counter for this numbering is reset at the start of 
a new section. 
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Chapter 2 is not covered, the instructor may need to omit material that 
depends on familiarity with the Lebesgue integral — in particular Section 4 
of Chapter 4.) Chapter 6 could be included to round off an introductory 
course on functional analysis. 

Chapter 1 could be used on its own as a second course on real analysis 
(following the typical advanced calculus course that introduces formal no- 
tions of convergence and continuity); it could also be used as a first course 
for senior students who have not previously encountered rigorous analysis. 
Chapters 1 and 2 together would make a good course on real variables, in 
preparation for either the material in Chapters 3 through 5 or a course on 
measure theory. The whole book could be used for a sequence of courses 
starting with real analysis and culminating in an introduction to functional 
analysis. 

I have drawn on the resource provided by many excellent existing texts 
cited in the bibliography, as well as some original papers (notably [39], in 
which Riesz introduced the development of the Lebesgue integral used in 
Chapter 2). My first drafts were prepared using the Scientific Word 
Processing System; the final version was produced by converting the drafts 
to and then using Scientific Word. Both and Scientific Word are 
products of TCI Software Research, Inc. 

I am grateful to the following people who have helped me in the 
preparation of this book: 

— Patrick Er, who first suggested that I offer a course in analysis for 
economists, which mutated into the regular analysis course from 
which the book eventually emerged; 

— the students in my analysis classes from 1990 to 1996, who suffered 
various slowly improving drafts; 

— Cris Calude, Nick Dudley Ward, Mark Schroder, Alfred Seeger, Doru 
Stefanescu, and Wang Yuchuan, who read and commented on parts 
of the book; 

— the wonderfully patient and cooperative staff at Springer-Verlag; 

— my wife and children, for their patience (in more than one sense). 

It is right and proper for me here to acknowledge my unspoken debt of 
gratitude to my parents. This book really began 35 years ago, when, with 
their somewhat mystified support and encouragement, I was beginning my 
love affair with mathematics and in particular with analysis. It is sad that 
they did not live to see its completion. 



Douglas Bridges 
28 January 1997 
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Introduction 



We may our ends by our beginnings know. 
OF PRUDENCE (Sir John Denham) 



What we now call analysis grew out of the calculus of Newton and Leib- 
niz, was developed throughout the eighteenth century (notably by Eu- 
ler), and slowly became logically sound (rigorous) through the work of 
Gauss, Cauchy, Riemann, Weierstrass, Lebesgue, and many others in the 
nineteenth and early twentieth centuries. 

Roughly, analysis may be characterised as the study of limiting pro- 
cesses within mathematics. These processes traditionally include the con- 
vergence of infinite sequences and series, continuity, differentiation, and 
integration, on the real number line R; but in the last 100 years analysis 
has moved far from the one- or finite-dimensional setting, to the extent 
that it now deals largely with limiting processes in infinite-dimensional 
spaces equipped with structures that produce meaningful abstractions of 
such notions as limit and continuous. Far from being merely the fantasti- 
cal delight of mathematicians, these infinite-dimensional abstractions have 
served both to clarify phenomena whose true nature is often obscured by the 
peculiar structure of R, and to provide foundations for quantum physics, 
equilibrium economics, numerical approximation — indeed, a host of areas of 
pure and applied mathematics. So important is analysis that it is no exag- 
geration to describe as seriously deficient any honours graduate in physics, 
mathematics, or theoretical economics who has not had good exposure to 
at least the fundamentals of metric, normed, and Hilbert space theory, if 
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not the next step, in which metric notions all but disappear in the further 
abstraction of topological spaces. 

Like many students of mathematics, even very good ones, you may find 
it hard to see the point of analysis, in which intuition often seems sacri- 
ficed to the demon of rigour. Is our intuition — algebraic, arithmetic, and 
geometric — not a sufficiently good guide to mathematical reality in most 
cases? Alas, it is not, as is illustrated by considering the differentiability of 
functions. (We are assuming here that you are familiar with the derivative 
from elementary calculus courses.) 

When you first met the derivative, you probably thought that any contin- 
uous (real-valued) function — that is, loosely, one with an unbroken graph — 
on an interval of R has a derivative at all points of its domain; in other 
words, its graph has a tangent everywhere. Once you came across simple 
examples, like the absolute value function x ^ \x\ ^ of functions whose 
graphs are unbroken but have no tangent at some point, it would have 
been natural to conjecture that if the graph were unbroken, then it had a 
tangent at all but a finite number of points. If you were really smart, you 
might even have produced an example of a continuous function, made up 
of lots of spikes, which was not differentiable at any of a sequence of points. 
This is about as far as intuition can go. But, as Weierstrass showed in the 
last century, and as you are invited to demonstrate in Exercise (1.5.1: 2), 
there exist continuous functions on R whose derivative does not exist any- 
where. Even this is not the end of the story: in a technical sense discussed 
in Chapter 6, most continuous functions on R are nowhere differentiable! 
Here, then, is a dramatic failure of our intuition. We could give examples of 
many others, all of which highlight the need for the sort of careful analysis 
that is the subject of this book. 

Of course, analysis is not primarily concerned with pathological exam- 
ples such as Weierstrass’s one of a continuous, nowhere differentiable func- 
tion. Its main aim is to build up a body of concepts, theorems, and proofs 
that describe a large part of the mathematical world (roughly, the contin- 
uous part) and are well suited to the mathematical demands of physicists, 
economists, statisticians, and others. The central chapters of this book, 
Chapters 3 through 5, give you an introduction to some of the fundamental 
concepts and results of modern analysis. The earlier chapters serve either as 
a background reference for the later ones or, if you have not studied much 
real analysis before, as a rapid introduction to that topic, in preparation 
for the rest of the book. The final chapter introduces some of the main 
themes of functional analysis, the study of continuous linear mappings on 
infinite-dimensional spaces. 

Having understood Chapters 3 through 6, you should be in a position to 
appreciate such other jewels of modern analysis as 



• abstract measure spaces, integration, and probability theory; 
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• approximation theory, in which complicated types of functions are 
approximated by more tractable ones such as polynomials of fixed 
maximum degree; 

• spectral theory of linear operators on a Hilbert space, generalising 
the theory of eigenvalues and eigenvectors of matrices; 

• analysis of one and several complex variables; 

• duality theory in topological vector spaces; 

• Haar measure and duality on locally compact groups, and the 
associated abstract generalisation of the Fourier transform; 

• C*- and von Neumann algebras of operators on a Hilbert space, 
providing rigorous foundations for quantum mechanics; 

• the theory of partial differential equations and the related potential 
problems of classical physics; 

• the calculus of variations and optimisation theory. 

These, however, are the subjects of other books. The time has come to 
begin this one by outlining the background material needed in the main 
chapters. 

Throughout this book, we assume familiarity with the fundamentals of 
informal set theory, as found in [20] . We use the following notation for sets 
of numbers. 

The set of natural numbers: N = {0, 1,2,.. .}. 

The set of positive integers: = {1,2,3,...}. 

The set of integers: Z = {0, — 1, 1, — 2, 2, . . .}. 

The set of rational numbers: Q = {±^ :m,nGN, n^O}. 

For the purposes of this preliminary section only, we accept as given the 
algebraic and order properties of the set R of real numbers, even though 
these are not introduced formally until Chapter 1. 

When the rule and domain describing a function f : A ^ B are known 
or clearly understood, we may denote / by 

X f{x). 

Note that we use the arrow — >■ as in “the function / : H — >■ H”, and the 
barred arrow i— as in “the function cc i— on R” . 

We regard two functions with the same rule but different domains as 
different functions. In fact, we define two functions / and g to be equal if 
and only if 
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• they have the same domain and 

• f{x) = g{x) for each x in that domain. 

Thus the function x ^ x"^ with domain N is not the same as the function 
X ^ x"^ with domain R. When considering a rule that defines a function, 
we usually take the domain of the function as the set of all objects x (or 
at least all x of the type we wish to consider) to which the rule can be 
applied. For example, if we are working in the context of R, we consider 
the domain of the function x i— l/{x — 1) to be the set consisting of all 
real numbers other than 1. 

We sometimes give explicit definitions of functions by cases. For example, 

{ 0 if a; is rational 
1 if a; is irrational 
defines a function / : R — >■ {0, 1}. 

A sequence is just a special kind of function: namely, one of the form 
n Xn with domain N^; Xn is then called the nth term of the sequence. 
We denote by {xn)'^=i, or (a;i, a; 2 , ■ • ■)> or even just (a:„), the sequence whose 
nth term is Xn- (Of course, n is a dummy variable here; so, for example, 
(xk) is the same sequence as (x„).) If all the terms of (a;„) belong to a set 
X, we refer to (x„) as a sequence in X. We also apply the word “sequence”, 
and notations such as (a:„)j)Tjy, to a mapping n Xn whose domain has 
the form {n G Z : n > n} for some integer v. 

A subsequence of (cc„) is a sequence of the form 

)fe=l “ (^ni ) Xn2 j )■■■)) 

where n\ < n 2 < < ■ ■ ■ . More generally, if / is a one-one mapping of 

into itself, we write {x or even just (xf^n)), to denote the sequence 

whose nth term is Xf(^n)- This enables us, in Section 2 of Chapter 1, to 
make sense of an expression like ^/(n)> denoting a rearrangement of 

the infinite series 

By a finite sequence we mean an ordered n-tuple (xi, . . . ,x„), where n 
is any positive integer. 

A nonempty set X is said to be countable, or to have countably many 
elements, if it is the range of a sequence. Note that a nonempty finite set 
is countable according to this definition. An infinite countable set is said 
to be countably infinite. We regard the empty set as being both finite and 
countable. A set that is not countable is said to be uncountable, and to 
have uncountably many elements. 

Let /, g be mappings from subsets of a set X into a set Y, where Y 
is equipped with a binary operation O. We introduce the corresponding 
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pointwise operation <> on f and g by setting 

ifOg){x) = f{x)Og{x) 

whenever f{x) and g(x) are both defined. Thus, taking T = R, we see that 
the (pointwise) sum of / and g is given by 

{f + g){x) = f{x)+g{x) 

if f{x) and g(x) are both defined; and that the (pointwise) quotient of / 
and g is given by 

if/g){x) = f{x)/g{x) 

if f{x) and g{x) are defined and g{x) ^ 0. li X = so that / = (x„) 
and g = (j/„) are sequences, then we also speak of termwise operations] for 
example, the termwise product of / and g is the sequence {xnyn)’^=i- 
Pointwise operations extend in the obvious ways to finitely many func- 
tions. In the case of a sequence {fn)'^=i of functions with values in a normed 
space (see Chapter 4), once we have introduced the notion of a series in a 
normed space, we interpret fn the obvious way. 

By a family of elements of a set X we mean a mapping A i— xy of a set 
L, called the index set for the family, into X. We also denote such a family 
by {x\)\^j^- A family with index set N'*' is, of course, a sequence. By a 
subfamily of a family we mean a family j where J C L. 

If (S'y);^g^ is a family of sets, we write 

\JS^={x:3XgL{xGS^)}, 

XeL 

f]S^={x:VX€L{x€S^)}, 

XeL 

and we call Uasl Hasl'^A’ respectively, the union and the 

intersection of the family 

We need some information about order relations on a set. (For fuller 
information about orders in general see Chapter 1 of [9].) 

A binary relation i? on a set X is said to be 

• reflexive if 

Vo G A (aRa)] 



• irreflexive if 



Va G A (not(ai?a)) ; 



• symmetric if 



Va, 6 G A {aRb bRa)] 
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• asymmetric if 

Vo, b € X {aRb not(6i?o)); 



• antisymmetric if 

Vo, b € X {{aRb and bRa) ^ a = b); 



• transitive if 



Vo, b,c € X {{aRa and bRc) oi?c); 



• total if 

Vo, b £ X {aRb or bRa). 

We use ^ to represent a reflexive relation, and to represent an irreflex- 
ive one. The notation a ^ b (respectively, a ^ b) is equivalent to 6 ^ o 
(respectively, 6 o). When dealing with the usual order relations on the 

real line R, we use the standard symbols >,>,<,< instead of 
respectively. 

A binary relation i? on a set X is said to be 

• a preorder if it is reflexive and transitive; 

• an equivalence relation if it is a symmetric preorder (in which case X 
is partitioned into disjoint equivalence classes, each equivalence class 
consisting of elements that are related under R, and the set of these 
equivalence classes, written X/R, is called the quotient set for R); 

• a partial order if it is an antisymmetric preorder; 

• a total order if it is a total partial order; 

• a strict partial order if it is asymmetric and transitive — or, equiva- 
lently, if it is irreflexive and transitive. 

If i? is a partial order on X, we call the pair {X, R) — or, when there is 
no risk of confusion, just the set X itself — a partially ordered set. 

With each preorder ^ on A we associate a strict partial order and an 
equivalence relation ^ on X, defined as follows. 

X y if and only if x y and not{y x); 

X ^ y if and only if x y and y x. 

If ^ is a total order, we have the Law of Trichotomy: 



Vx, y, z £ X {x y or X = y or X < y) . 
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Let S' be a nonempty subset of a partially ordered set {X, ^). An element 
B G X is called an upper hound, or majorant, of S (relative to if B ^ x 
for all X G S. If there exist upper bounds of S, then we say that S is bounded 
above, or majorised. An element B G X is called a least upper bound, or 
supremum, of S if the following two conditions are satisfied. 

— i? is an upper bound of S; 

— if B' is an upper bound of S, then B' B. 

Note that S has at most one supremum: for if B, B' are suprema of S, then 
B' B B' and so B' = B, by the antisymmetry of ^ . If the supremum 
of S exists, we denote it by sup S. We also denote it by 

sup Xi, maxS, max Xj, or xi V X 2 V • • • V x„ 

l<i<n l<i<n 

if S = {xi, . . . , Xn} is a finite set, and by 

00 

SUpXn or W Xn 

n>l T 

— n—1 

if S = {xi, X 2 , . . .} is a countable set; we use similar notations without fur- 
ther comment. An upper bound of S that belongs to S is called a maximum 
element of S, and is then a least upper bound of S. The maximum element, 
if it exists, of S is also called the largest, or greatest, element of S. 

An element b G X is called a lower bound, or minorant, of S (relative to 
if X ^ 6 for all X G S. If there exist lower bounds of S, then we say that 
S is bounded below, or minorised. An element b G X is called a greatest 
lower hound, or infimum, of S if the following two conditions are satisfied. 

— 6 is a lower bound of S; 

— if b' is a lower bound of S, then b)p b' . 

S has at most one infimum, which we denote by inf S. When describing 
infima, we also use such notations as 

inf Xi, min S', min Xj, or Xi A X 2 A • • • A x„ 

l<2<n l<i<n 

if S = {xi, . . . , Xn} is a finite set, and 

00 

inf Xn or A Xn 

n>l ' ' 

n—1 

if S = {xi, X 2 , . . .} is a countable set. A lower bound of S that belongs to 
S is called a minimum element of S, and is a greatest lower bound of S. 
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The minimum element, if it exists, of S is also called the smallest, or least, 
element of S. 

The usual partial order > on R gives rise to important operations on 
functions. If /, g are real-valued functions, we write f > g (or g < f) to 
indicate that f{x) > g(x) for all x common to the domains of / and g. 
Regarding V and A as binary operations on R, we define the corresponding 
functions / V g and f A g as special cases of the notion fOg previously 
introduced. By extension of these ideas, if is a sequence of real- 

valued functions, then the functions V^i /« A^i /« defined by 

( oo \ oo 

V = V Mx), 

n^l / n^l 

( oo \ oo 

A = A 

n=l / n=l 

whenever the right-hand sides of these equations make sense. 

Now let / be a mapping of a set X into the partially ordered set (R, >). 
We say that / is bounded above on X if 

f{X) = {fix) :xGX} 

is bounded above as a subset of Y. We call supf{X), if it exists, the supre- 
mum of f on X, and we denote it by sup/, sup2,g;if fi^), or, in the case 
where X is a finite set, max /. We also use obvious variations on these nota- 
tions, such as sup„>^ /(n) when X = We adopt analogous definitions 
and notations for bounded below on X, infimum of f, inf /, and min /. 

Finally, let / be a mapping of a partially ordered set (X,)^) into the 
partially ordered set (R, >). We say that / is 

— increasing if f{x) > f{x') whenever x x'-, 

— strictly increasing if f{x) > f{x') whenever x >~ x'] 

— decreasing if f{x) < f{x') whenever x x'-, and 

— strictly decreasing if f{x) < f{x') whenever x >- x' . 

Note that we use “increasing” and “strictly increasing” where some authors 
would use “nondecreasing” and “increasing”, respectively. 
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Analysis on the Real Line 



...I will a round unvarnish’d tale deliver... 

OTHELLO, Act 1, Scene 3 

In this chapter we provide a self-contained development of analysis on the real 
number line. We begin with an axiomatic presentation of R, from which we de- 
velop the elementary properties of exponential and logarithmic functions. We 
then discuss the convergence of sequences and series, paying particular atten- 
tion to applications of the completeness of R. Section 3 introduces open and 
closed sets, and lays the groundwork for later abstraction in the context of a 
metric space. Section 4 deals with limits and continuity of real-valued functions; 
the Heine-Borel-Lebesgue and Bolzano-Weierstrass theorems prepare us for the 
general, and extremely useful, notion of compactness, which is discussed in Chap- 
ter 3. The final section deals with the differential and integral calculus, a subject 
that is reviewed from a more advanced standpoint in Chapter 2. 



1.1 The Real Number Line 

Although it is possible to construct the real number line R from N using 
elementary properties of sets and functions, in order to take us quickly 
to the heart of real analysis we relegate such a construction to Appendix 
A and instead present a set of axioms sufficient to characterise R. These 
axioms fall into three categories: the first introduces the algebra of real 
numbers; the remaining two are concerned with the ordering on R. 
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Axiom Rl. R zs o field — that is, there exist 

a binary operation (x,y) x + y oi addition on R, 
a binary operation {x,y) i— xy of multiplication^ on R, 
distinguished elements 0 (zero) and 1 (one) of R, with 0 yf 1, 
a unary operation x i— —x (negation) on R, and 
a unary operation x i— x~^ of reciprocation, or inversion, on R\ {0} 
such that for all x,y, z G R, 

x + y = y + x, 

(x + y)+z = x+{y + z), 

0 + X = X, 

X + (—x) = 0, 
xy = yx, 

(xy) z = x (yz) , 
x(y + z) = xy + xz, 
lx = X, and 
xx~^ = 1 if a: yf 0. 

Of course, we also denote x~^ by ^ or 1/x. 

Axioms R2. R is endowed with a total partial order > (greater than 
or equal to), and hence an associated strict partial order > (greater than), 
such that 

• x>y, then x + z > y + z, and 

• if X > 0 and y > 0, then xy > 0. 

Axiom R3. The least-upper-bound principle: if a nonempty subset S of 
R is bounded above relative to the relation >, then it has a (unique) least 
upper bound. 

The elements of R are called real numbers. We say that a real number 
X is 

• positive if X > 0, 

• negative if — x > 0, and 



^For clarity, we sometimes write x ■ y ot x x y for the product xy. 
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• nonnegative if x > 0. 

We denote the set of positive real numbers by R^, and the set of 
nonnegative real numbers by 

Many of the fundamental arithmetic and order properties of R are imme- 
diate consequences of results in the elementary theories of fields and partial 
orders, respectively. A number of these, illustrating the interplay between 
the algebra and the ordering on R, are given in the next set of exercises.^ 



(1.1.1) Exercises 

Prove each of the following statements, where x,y,Xi,yi (1 < i < n) are 
real numbers. 

.1 If Xi > yi for each i, then ^ Y^7=iVi- If ^Iso Xk > yk for 

some k, then Ya=i > Yh=i Vi- 

.2 X > y ii and only ii x + z > y + z for all z € R ; this remains true 
with each instance of > replaced by one of > . 

.3 If Xi > 0 for each i and then xi = X 2 = • • • = x„ = 0. 

.4 The following are equivalent: x > y, x — y > 0, —y > —x, 0 > y — x-, 
these equivalences also hold with > replaced everywhere by > . 

.5 li X > y and z > 0, then xz > yz. 

.6 If X > 0 and y > 0, then xy > 0; if x > 0 and 0 > y, then 0 > xy, if 

0 > X and 0 > y, then xj/ > 0; and these results hold with > replaced 
everywhere by >. 

.7 x^ > 0, and x^ = 0 if and only if x = 0. 

.8 If X > 0, then x~^ > 0; and if x < 0, then x~^ < 0. 

.9 X > y if and only if xz > yz for all z > 0. 

.10 X > y > 0 if and only if y ^ > x ^ > 0. 

.11 max{x,y} > 0 if and only if x > 0 or y > 0; max{x,y} > 0 if and 
only if X > 0 or y > 0. 

.12 min{x,y} > 0 if and only if x > 0 and y > 0; min{x,y} > 0 if and 
only if X > 0 and y > 0. 



you are comfortable with the elementary field and order properties of R, 
then you can safely omit Exercises (1.1.1) and (1.1.2). 
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.13 The mapping 

if n = 0 
if n > 1 



from N into R is one-one and preserves order, addition, and 
multiplication. 

We use this mapping to identify N with the subset {nl : n G N} of 
R. In turn, we then identify a negative integer n with — (— n) 1, and 
a rational number m/n with the real number mn~^. We make these 
identifications without further comment. 

.14 If S' is a nonempty majorised set of integers, then m = supS is an 
integer. (Assume the contrary and obtain integers n,n' such that 
m — l<n<n'< m.) 

.15 There exists n G Z such that n — 1 < a; < n. (If a; > 0, apply the 
least-upper-bound principle toS = {fcGZ:/c<x}.) 

.16 If a; > 0 and y > 0, then there exists n G such that nx > y. 
(Consider {fc G N kx <y} .) 

This important property is sometimes introduced as an axiom, the 
Axiom of Archimedes. 

.17 a; > 0 if and only if there exists a positive integer n > x 

.18 X > 0 if and only if x > — 1/n for all positive integers n. 

.19 Q is order dense in R — that is, if x < y, then there exists y G Q such 
that X < q < y. (Reduce to the case y > 0. Choose in turn integers 
n > l/(y — x) and k > ny, and let m be the least integer such that 
y < m/n. Show that x < (m — l)/n < y.) 

.20 If S and T are nonempty majorised sets of positive numbers, then 

sup{st : s G S, t gT} = sups' x supT. 

.21 The following are equivalent conditions on nonempty subsets X and 
Y of R. 

(i) X < y for all x G AT and y GY. 

(ii) There exists r G R such that x < r < y for all x G X and y GY. 



n i-G nl = 






n terms 
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Each real number x has a corresponding absolute value, defined as 

|a;| = max {x, —a;} . 



(1.1.2) Exercises 

Prove each of the following statements about real numbers x,y,e. 

.1 |x| > 0, and |x| = 0 if and only if a: = 0. 

.2 |x| < £ if and only if — £ < x < e. 

.3 \x\ < £ if and only if — £ < a; < £. 

.4 X = 0 if and only if either |x| < e for each £ > 0 or else \x\ < e for 
each £ > 0. 

.5 |a; + j/| < |x| + \y\ {triangle inequality). 

.6 |x-j/| > ||x| - |y||. 

.7 \xy\ = \x\\y\. 

So far we have not indicated how useful the least-upper-bound principle 
is. In fact, it is not only useful, but essential: the field Q of rational numbers, 
with its usual ordering >, satisfies all the properties listed in axioms R1 
and R2, so we need something more to distinguish R from Q. Moreover, 
without the least-upper-bound principle or some property equivalent to it, 
we cannot even prove that a positive real number has a square root. 

We now sketch how the least-upper-bound principle enables us to define 
for any a > 0 and any r S R. When n is an integer, a" is defined as in 
elementary algebra. So our first real task is to define a"*/” when m and n 
are nonzero integers; this we do by setting 

a™/” = sup {x G R : x" < a™} . (1) 

Of course, we are using the least-upper-bound principle here, so we must 
ensure that the set on the right-hand side of (1) is both nonempty and 
bounded above. To prove that it is nonempty, we use the Axiom of 
Archime- des (Exercise (1.1.1: 16)) to find a positive integer k such that 
ka™ > 1; then k^a”^ > kd^ > 1, so (1/A:)” < a™. On the other hand, as 

(1 + a™)” > 1 + na*” > 0 ™ if n > 1, and 
(i+b") — 1 ~ >0™ if n < —1, 

the set in question is bounded above (by 1 + a™ in the first case, and by 
1/ (1 + o’”) in the second). Hence a™/” exists. 

Our first result enables us to prove some basic properties of o’”/". 
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(1.1.3) Lemma. Let a > 0 and s he real numbers, and m,n positive 
integers such that s” < a™. Then there exists t G H such that s < t and 
r < a™. 

Proof. Using Exercise (1.1.1: 16), choose a positive integer N such that 
0<N~^ < min {l, 2-” (1 + |s|)”” (a™ - s")} . 

Writing t = s + N~'^ and using the binomial theorem, we have 




(i+i,irjv-'|:("^ 

2”(1+ |s|)"lV-i 



< s” + 



< s'"- + 



< s"‘ + 



= s 
< a" 



as we required. □ 

Taking s = 0 in this lemma, we see that a™/” > 0. The lemma also 
enables us to prove that 

= a™. (2) 

For if (a™/”)" < a™, then, by Lemma (1.1.3), there exists t > a™/" such 
that t" < o’", which contradicts the definition of a™/"; on the other hand, 
that same definition ensures that (a"*/") < a™ and hence that (2) holds. 

Using (2) and methods familiar from elementary algebra courses, we can 
now prove the usual laws of indices, 

{aj-y = ary 

when the indices r, s are rational. 

We next extend the definition of to cover all r G R. To begin with, 
we consider the case a > 1, when we define 



of = sup {a'^ : q G Q, q < r} . 



(3) 
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It is left as an exercise to show that the set on the right-hand side of (3) 
is nonempty and bounded above, and that if r is rational, this definition 
gives a’’ the same value as the one given by our earlier definition. We can 
now prove the laws of indices for arbitrary r, s G R. Taking the first law 
as an illustration, we observe that if u, v are rational numbers with u < r 
and V < s, then u + v < r + s, so 

< a’'+®. 

By Exercise (1.1.1: 20), 

a’’a® = sup {a“ : u £ Q, u < r} x sup {a'" : v £ Q, v < s} 

= sup {d^d" : u,v £ Q, u < r, u < s} 

< 

On the other hand, if g G Q and q < r + s, then we choose rational numbers 
u, V with u < r, V < s, and g = u + u : to do so, we use Exercise (1.1.1: 19) 
to find u £ Q with q — s < u < r and we then set v = q — u. We have 

a<} = a^+^ =a^a^ <a^ad 

Hence 

= sup {a'^ ■ q £ Q, q < r + s} < a’’a®, 

and therefore o’’ a® = 

It remains to define 

( (a“^) ^ if 0 < a < 1 

a^ = 

[ 1 if a = 1 

and to verify — routinely — that the laws of indices hold in these cases also. 
(1.1.4) Exercises 

.1 Let a > 1 and let r G R. Prove that {a^ : g G Q, g < r} is nonempty 
and bounded above. Prove also that if r = m/n for integers m, n with 
n 0, then definitions (1) and (3) give the same value for ad . 

.2 Prove that if 0 < a 1 and = 1, then x = 0. (Consider first the 

case where a > 1, and note that if g G Q and o'? < 1, then g < 0.) 

.3 Let a > 0 and x > y. Prove that if a > 1, then > a^; and that if 
a < 1, then . 

.4 Prove that if a > 0, then for each a; > 0 there exists a unique y G R 

such that = x. (First take a > 1 and x > 1. Write a = 1 + t and, 

by expanding (1 + t)”, compute n G such that a” > x. Then 
consider {g G Q : a'^ < x}.) 
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.5 Let / be a strictly increasing mapping of R onto such that /(O) = 
1 and f{x+y) = f{x)f{y). Prove that f{x) = a^, where a = /(I) > 1. 
(First prove that f(q) = for all rational q.) 

If a > 0, Exercise (1.1.4: 4) allows us to define log^j, the logarithmic 
function with base a, as follows. For each x > 0, 

y = log„ X if and only if = x. 

This function has domain R^ and maps R^ onto R. From the laws of 
indices we easily deduce the laws of logarithms : 

loga xy = log„ X + log„ y, 
loga(a;’') = rlog^x, 

logf, X = log^ a X log^ X, where 6 > 0. 

Anticipating the theory of convergence of series from the next section, 
we introduce the number 



and call loge the natural logarithmic function on R^. It is customary to 
denote loge by either log or In . 

(1.1.5) Exercises 

.1 Prove the laws of logarithms. 

.2 Prove that if a > 1, then the function log^, is strictly increasing; and 
that if 0 < a < 1, then log^j is strictly decreasing. 

.3 Let a > 1, and let / be an increasing mapping of R^ into R such 
that /(a) = 1 and f{xy) = /(x) + f{y). Prove that /(x) = log^, x. 

For convenience, we collect here the definitions of the various types of 
interval in R. 

The open intervals are the sets of the following forms, where a, b are real 
numbers with a < b : 



(o, b) = {x G 
(a, oo) = {x G 
(— oo, 6) = {x G 
(— oo, oo) = R. 



R : a < X < 6} , 
R : a < x} , 

R : X < 6} , 
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The closed intervals are the sets of the following forms, where a, b are real 
numbers with a < b : 



[a, b] = {x € R : a < X < b} , 

[a, oo) = {a; G R : a < x} , 

(— 00 , 6] = {a; G R : a; < 5} . 

By convention, R is regarded as both an open interval and a closed interval. 
The remaining types of interval are: 

half open on the left: {a,b] = {a; G R : a < a; < 6} , 

half open on the right: [a, 6) = {a; G R : a < a; < 6} . 

Intervals of the form [a, b], (a, b), [a, b), or (a, b], where a, & G R, are said 
to be finite or bounded, and to have left endpoint a, right endpoint b, and 
length b — a. Intervals of the remaining types are called infinite and are 
said to have length 00 . The length of any interval I is denoted by |/| . A 
bounded closed interval in R is also called a compact interval. 

Finally, we define the complex numbers to be the elements of the set 
C = R X R, with the usual equality and with algebraic operations of 
addition and multiplication defined, respectively, by the equations 

(x,y) + {x,y) = {x + x,y + y), 

(x,y) X {x' ,y') = {xx' - yy ' , xy' + x'y) . 

Then x i-G (x,0) is a one-one mapping of R onto the set C x {0} and is 
used to identify R with that subset of C. With this identification, we have 
i^ = —1, where i is the complex number (0, 1); so the complex number 
{x, y) can be identified with the expression x + iy. The real numbers x and 
y are then called the real and imaginary parts of z = (x,y), respectively, 
and we write 



X = Re{x,y), 
y = Im(a;,7/). 



The conjugate of z is 

z* = {x,-y) =x-iy, 



and the modulus of z is 

|z| = \/x2 + 

In the remainder of this book we assume the basic properties of the real 
and complex numbers such as those found in the foregoing exercises. 
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1.2 Sequences and Series 

Although often relegated to a minor role in courses on real analysis, the 
theory of convergence of sequences and series in R provides both a model 
for more abstract convergence theories such as those in our later chapters, 
and many important examples. 

It is convenient to introduce here two useful expressions about properties 
of positive integers. Let P(m, n) be a property applicable to pairs (m, n) of 
positive integers. If there exists N such that P{m, n) holds for all m,n> N, 
then we say that P{m, n) holds for all sufficiently large m and n. We 
interpret similarly the statement P(ji) holds for all sufficiently large n, 
where P(n) is a property applicable to positive integers n. On the other 
hand, if for each positive integer i there exists a positive integer j > i such 
that P{j) holds, then we say that P{n) holds for infinitely many values of 
n. 

We say that a sequence^ (a„) of real numbers converges to a real number 
a, called the limit of (a„), if for each £ > 0 there exists a positive integer 
N, depending on £, such that |a — a„| < e whenever n > N. Thus (a„) 
converges to a if and only if for each £ > 0 we have |a — a„| < £ for all 
sufficiently large n. In that case we write 

lim On = a 

n— >-C30 

or 

a„ — >■ a as n — >■ oo, 

and we also say that a„ tends to a as n — >■ oo. 

On the other hand, we say that (a„) diverges to oo, and we write 

a„ — >■ oo as n — >■ oo, 

if for each AT > 0 we have a„ > K for all sufficiently large n. If for each 
AT > 0 we have a„ < —K for all sufficiently large n, then we say that (a„) 
diverges to — oo, and we write 

a„ — >■ — oo as n — >■ oo. 



(1.2.1) Exercises 

.1 Prove that if (a„) converges to both a and o', then a = a'. (Show 
that \a — a'l < £ for each £ > 0. This exercise justifies the use of the 
definite article in the phrase “the limit of (a„)”.) 



®We can extend the definitions of convergence and divergence of sequences in 
the obvions ways to cover families of the form {a„)n>v, where € Z; all that 
matters is that a„ be defined for all sufficiently large positive integers n. This 
observation makes sense of the last part of Proposition (1.2.2), where we discuss 
the limit of a quotient of two sequences. 
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.2 Let c > 0. Prove that (a„) converges to a if and only if for each 
£ > 0 there exists a positive integer N, depending on e, such that 
|a — a„| < c£ for all n> N. 

.3 Prove that if a sequence (a„) converges to a limit, then it is hounded, 
in the sense that there exists c > 0 such that |a„| < c for all n. 

.4 Let r G R, and let (a„) be a convergent sequence in R such that 
lim„_>oo a„ > r. Prove that On> r for all sufficiently large n. 

.5 Let r G R, and let (a„) be a convergent sequence in R such that 
> r for all sufficiently large n. Prove that lim„_>oo a„ > r. 

.6 Prove that if (a„) diverges to infinity and (&„) converges to a limit 
5 G R, then the sequence (a„ + &„) diverges to infinity. 

The process of taking limits of sequences preserves the basic operations 
of arithmetic. 

(1.2.2) Proposition. Let (a„) and (&„) he sequences of real numbers 
converging to limits a and h, respectively. Then as n — >■ oo, 

Qn hji — y a b, 

a„ - b„ ^ a - b, 
o^nbn ^ ab, 

max{a„, 6„} — >■ max {a, b} , 
min {a„, 6„} — >■ min {a, b} , and 
Wn\ |a| ■ 

If also b ^ 0, then 0 for all sufficiently large n, and anjbn — >■ a/6 as 
n — >■ oo. 

Proof. We prove only the last statement, leaving the other cases to 
Exercise (1.2.3: 1). 

Assume that 6 0. Then, by Exercise (1.2.1: 4), there exists Nq such that 

|6„| > \ \h\, and therefore anjbn is defined, for all n > TVq. Given £ > 0, 
choose > A^o such that |a„ — a| < £ and |6„ — 6| < £ for all n > N. For 
all such n we have 

On a _ |6a„ - o6„| 

^"6 \bn\ |6| 

|6(g„ - g) + g(6 - 6„)| 

< 2 |6|“^ (|6| |g„ - g| + |a| \b - 6„|) 

<2|6|-^(|g| + |6|)£. 
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The result now follows from Exercise (1.2.1: 2). □ 

(1.2.3) Exercises 

.1 Prove the remaining parts of Proposition (1.2.2). 

.2 Prove that if fc > 2 and ly > 1 are integers, then 

(1 + i) > i (1 + l) ^ ^ 

Hence prove that if 0 < |r| < 1, then r” — >■ 0 as n — >■ oo. (Given 
£ > 0, first choose v such that 1/v < e. Then choose k such that 
|r|“^ > 1 + k~^.) 

.3 Prove that if r > 1, then r" — >■ oo as n — >■ oo. 

.4 Prove that if a > 1, then log^j n — >■ oo as n — >■ oo. 

.5 Prove that if r = lim„_>oo On, then r = limfc_>oo for any 

subsequence (a„^)^;^ of (a„). 

.6 Let (a„) be a sequence of real numbers such that the subsequences 
(« 2 n)))foi and (a 2 n+i)^i both converge to the limit 1. Prove that (a„) 
converges to 1. 

.7 Let (a„) be a sequence in R. Prove that if the three subsequences 
(d 2 n), (o 2 n+i), and (asn) are convergent, then so is (a„). 

.8 Give an example of a sequence (a„) of real numbers with the following 
properties. 

(i) (a„) is not convergent; 

(ii) for each k > 2 the subsequence (akn)^=i is convergent. 

(Split your definition of a„ into two cases — one when n is prime, the 
other when n is composite.) 

When we apply notions such as bounded above, supremum, and infimum 
to a sequence (s„) of real numbers, we are really applying them to the set 
{sn : n > 1} of terms of the sequence. Thus the supremum (respectively, 
infimum) of a majorised (respectively, minorised) sequence (s„) is denoted 
by sup„> 2 ^ s„, or just sups„ (respectively, inf„>i s„, or just inf s„). 

The next result, known as the monotone sequence principle, is a powerful 
tool for proving the existence of limits. 

(1.2.4) Proposition. An increasing majorised sequence of real numbers 
converges to its least upper bound; a decreasing minorised sequence of real 
numbers converges to its greatest lower bound. 
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Proof. Let (s„) be an increasing majorised sequence of real numbers, and 
s its least upper bound. For each £ > 0, since s — £ is not an upper bound 
of (s„), there exists N such that > s — e. But (s„) is both increasing 
and bounded above by s; so for all n > we have s — £ < s„ < s and 
therefore |s — s„| < £. Since £ > 0 is arbitrary, it follows that s„ — >■ s as 
n — >■ oo. 

The case of a decreasing minorised sequence is left as an exercise. □ 

(1.2.5) Exercises 

.1 Prove the second part of the last proposition in two ways. 

.2 Prove that an increasing sequence of nonnegative real numbers 
diverges to infinity if and only if it is not bounded above. 

.3 Let a > 1 and a; > 0. Prove that there exists an integer m such 
that < X < (First take a: > 1, and consider the sequence 

.4 Discuss the convergence of the sequence (a„) defined by a„+i = 
yTolf, where a\ and r are positive numbers. 

.5 Prove that if 0 < a and A: G N, then lim„_>oo = 1- (First 

consider the case where fc = 0 and 0 < a < 1. Apply the monotone 
sequence principle to show that the sequence ( converges to 
a limit 1. By considering the subsequence ( ^^/a) , show that ^/l = 1.) 

.6 Prove that if (a„) is a sequence of positive numbers such that 

? = lim ^ 

n-s-oo o„ 

exists, then lim„_>oo A/on = I- By considering the sequence 
1, a, ah, a?b, , a^h^,a^b ^ , . . . , 

where a, b are distinct positive numbers, show that the converse is 
false. 

.7 Prove that if n > 2, then (n + 1)” < Use this to 

show that I = lim„_>oo exists. By considering the subsequence 

( , prove that ^ = 1. Hence show that if a > 1, then 

lim„_>oo (n“Mog„n) = 0. 

.8 Prove that the sequence ((l + is convergent. (An interest- 

ing proof of this result, based on the well-known inequality involving 
arithmetic and geometric means, is found in [32].) 
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.9 Let (a„) be a sequence of real numbers. If (a„) is bounded above, 
then its upper limit, or limit superior, is defined to be 

lim sup a„ = inf sup{a„ , a„+ 1 , a „+2 , . . . } 

n>l 

if the infimum on the right exists. Prove that a real number s equals 
lim sup a„ if and only if for each e > 0, 

— a„ < s + e for all sufficiently large n, and 
— On > s — e for infinitely many values of n. 

Prove also that 

limsupa„= lim sup{a„, a„+i, a„+ 2 , . . .}. 

n—^oo 

.10 If (a„) is bounded below, then its lower limit, or limit inferior, is 
defined to be 



lim inf a„ = supinf{a„, a„+i,a„+ 2 , . . .} 

n>l 

if the supremum on the right exists. Establish necessary and sufficient 
conditions for a real number I to equal lim inf a„. 

.11 Prove that a„ — >■ a € R as n — >■ oo if and only if 

lim inf a„ = a = lim sup a„ . 



A sequence (Sn)^-i of subsets of R is said to be nested, or descending , 
if S'! D 52 S'a D • • • . We make good use of the following nested intervals 
principle. 

(1.2.6) Proposition. The intersection of a nested sequence of closed 
intervals in R is nonempty. 

Proof. Let ([a„, &„]) be a nested sequence of closed intervals in R. Then 

ai <a„< a„+i < b„+i < &„ < 6i (1) 

for each n. By Proposition (1.2.4), (a„) converges to its least upper bound 
a, and (6„) converges to its greatest lower bound b. It follows from the 
inequalities (1) and Exercise (1.2.1: 5) that a < &. So for each n, Un < a < 
b < bn and therefore a € [a„, b„]. □ 

The following elementary lemma leads to simple proofs of several 
important results in analysis. 

(1.2.7) Lemma. If (an) is a sequence of real numbers, then at least one 
of the following holds. 
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(i) (a„) has a constant subsequence; 

(ii) (an) has a strictly increasing subsequence; 

(iii) (an) has a strictly decreasing subsequence. 

Proof. Suppose that (a„) contains no constant subsequence, and con- 
sider the set 

S = {n € :\/k >n (a„ > Ofe)} . 

If S is bounded, then there exists N such that 

\/n > N 3k > n (au > an ) , 

and a simple inductive construction produces positive integers N < n\ < 
U 2 < ■ ■ ■ such that > o„j, for each k. If, on the other hand, S is 

unbounded, then we can compute ni < n 2 < • • • such that a„j, > for 

each k. In that case, since contains no constant subsequence, for 

each k there exists j > k such that a„j, > a„^. ; it is now straightforward to 
construct a strictly decreasing subsequence of (on^)- □ 

(1.2.8) Corollary. A bounded sequence of real numbers has a convergent 
subsequence. 

Proof. This follows from Lemma (1-2.7) and the monotone sequence 
principle. □ 

A sequence (a„) of real numbers is called a Cauchy sequence if for each 
£ > 0 there exists a positive integer N, depending on e, such that \am—an\ < 
e for all m,n > N. 

(1.2.9) Exercises 

.1 Prove that a convergent sequence of real numbers is a Cauchy 
sequence. 

.2 Prove that a Cauchy sequence is bounded. 

.3 Prove that if a Cauchy sequence (a„) has a subsequence that 
converges to a limit a G R, then (a„) converges to a. 

.4 Let (on) be a bounded sequence each of whose convergent subse- 
quences converges to the same limit. Prove that (a„) converges to 
that limit, (cf. Exercises (1.2.3: 6 and 7). By Corollary (1.2.8), there 
is a subsequence (a„j.) that converges to a limit 1. Suppose that (a„) 
does not converge to I, and derive a contradiction.) 
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One of the most important results in convergence theory says that not 
only does a Cauchy sequence of real numbers appear to converge, in that 
its terms get closer and closer to each other as their indices increase, but 
it actually does converge. 

A subset S' of R is said to be complete if each Cauchy sequence in S 
converges to a limit that belongs to S. 

(1.2.10) Theorem. R is complete. 

Proof. Let (a„) be a Cauchy sequence in R. Then (a„) is bounded, 
by Exercise (1.2.9: 2). It follows from Corollary (1.2.8) that (a„) has a 
convergent subsequence; so (a„) converges, by Exercise (1.2.9: 3). □ 



(1.2.11) Exercises 

.1 Find an alternative proof of the completeness of R. (Given a Cauchy 
sequence (a„) in R, consider liminfa„.) 

.2 Show that if, in the system of axioms for R, the least-upper- 
bound principle is replaced by the Axiom of Archimedes (Exercise 
(1.1.1:16)), then the nested intervals principle is equivalent to the 
completeness of R. Can you spot where you have used the Axiom of 
Archimedes? 



.3 Under the conditions of the preceding exercise, show that the least- 
upper-bound principle follows from the completeness of R. (Assum- 
ing that R is complete, consider a nonempty majorised subset S of 
R. Choose Si € S' and bi £ B, where B is the set of upper bounds 
of S. Construct a sequence (s„) in S and a sequence (6„) in B such 
that 

^ S ^n+1 ^ 






Prove that 0 < — 6m < 2 ”+^(6i — si) whenever m> n, that (s„) 

and (6„) converge to the same limit 6, and that 6 = supS.) 



.4 Prove Cantor’s Theorem: if (a„) is a sequence of real numbers, then 
in any closed interval of R with positive length there exists a real 
number x such that a; a„ for each n. (For each x G R and each 
nonempty S C R, define the distance from x to S to be the real 
number 

p{x, S) = inf{|x — s| : s G S}. 



First prove the following lemma. If / = [a, 6] is a closed interval 
with positive length, and Ji, J 2 , ^3 are the left, middle, and right 
closed thirds of /, then for each real number x either p(x, Ji) > 0 
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or p{x,J^) > 0. Use this lemma to construct an appropriate nested 
sequence of closed intervals. 

This argument is a refined version of the “diagonal argument” first 
used by Cantor. An interesting analysis of Cantor’s proof, and of the 
misinterpretation of that proof over the years, is found in [19].) 

.5 Prove that R\Q is order dense in R. 

The study of infinite series, a major part of analysis in the eighteenth 
and nineteenth centuries (see [27]), still provides interesting illustrations of 
the completeness of R. 

Let be a sequence of real numbers. The real number 

k 

^ ^ (kji 

n—1 

is called the fcth partial sum of the series Formally, we define 

the series with nth term a„ to be the sequence (si, S 2 , ...) of its 

partial sums. The sum of that series is the limit s of the sequence (s„), if 
that limit exists, in which case we say that the series is convergent, or that 
it converges to s, and we write 



n—1 

We use analogous notations and definitions for the series associated with 
a family (a„))]T,^ of real numbers indexed by {n G Z : n > n} , where v is 
an integer, and for the series associated with a family (on)„g2 

indexed by Z. We commonly write ^^r the series when it 

is clear that the indexing of the terms of the series starts with n. 

The completeness of R is used in the justification of various tests for the 
convergence of infinite series. These tests are useful because they enable us 
to prove certain series convergent without finding explicit values for their 
sums. For example, a number of convergence tests easily show that the 
series convergent; but it is considerably harder to show that 

the sum of this series is actually tt‘^/6 (Exercise (5.2.12:7); see also [31]). 
We begin with the comparison test. 

(1.2.12) Proposition. If^^=ibn is a convergent series of nonnegative 
terms, and z/ 0 < a„ < for each n, then converges. 

Proof. Let b be the sum of the series bn- Then for each N we have 

N Af+1 N+l 

^ ^ o^n — 'y ^ O.JI ^ y ] bji ^ b, 

n—1 n—1 n—1 
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SO the partial sums of form an increasing majorised sequence. It 

follows from the monotone sequence principle that converges. □ 

(1.2.13) Proposition. // (a„) is a decreasing sequence of positive num- 
bers converging to 0, then the alternating series converges 

(Leibniz’s alternating series test). 

Proof. For each k let 

k 

Sk = 

n— 1 

Then 

S2k+2 — S2k = 02fc+l — a,2k+2 > 0 

and 

fll ~ S2k = (Q2 ~ 03 ) + + (02fe-2 ~ 0,2k-l) + 0.2k > 0- 

So the sequence (s 2 fc)^i is increasing and bounded above; whence, by the 
monotone sequence principle, it converges to its least upper bound s. Now, 

|s — S2m+l| = |s ~ S2m ~ a2m+l| < |s ~ S2m\ + 02m+l- 

Also, both |s — S 2 m\ and a 2 m+i converge to 0 as m — >■ oo. It follows that if 
£ > 0, then |s — S 2 m| < £ and |s — S 2 m+i| < £ for all sufficiently large m. 
Hence converges to s, by Exercise (1.2.3: 6). □ 

(1.2.14) Exercises 

.1 Prove that if the series ^ a„ converges, then lim„_>oo a,„ = 0. By 
considering ^1 sfo-, or otherwise, show that the converse is false. 

.2 Prove the comparison test using the completeness of R, instead of 
the least-upper-bound principle. 

.3 A series of nonnegative terms is said to diverge if the corresponding 
sequence (s„) of partial sums diverges to infinity. Prove the limit 
comparison test: If (a„) and (6„) are sequences of positive numbers 
such that 

lim ^ > 0, 

n— >oo 

then either ^ a„ and ^ b„ both converge or else they both diverge. 

.4 Prove that if |r| < 1, then the geometric series r” converges 

and has sum 1/ (1 — r) . What happens to the series if |r| > 1? 

.5 Let & > 2 be an integer, and x £ [0,1]. Show that there exists a 
sequence (a„) of integers such that 
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(i) 0 < o„ < & for each n, and 

(ii) a; = 

Show that this sequence (a„) is uniquely determined by x unless there 
exist k,n € N such that x = kb~^, in which case there are exactly 
two such sequences. 

Conversely, show that if (an) is a sequence of integers satisfying 
(i), then converges to a sum x in [0,1]. (The series 

is called the b-ary expansion of x, or the expansion of x 
relative to the base b. If b = 2, the series is the binary expansion of 
X, and if 6 = 10, it is the decimal expansion.) 

.6 Prove that 



(i) is divergent if p < 1; 

(ii) is convergent. 

(For (i), first prove the divergence of l/ai by considering the 

2’^ 

partial sums 1/?^^ for iV = 1, 2, ) 

.7 Prove that the series 



- + - + - 
9 ^ 19 ^ 29 



H h 



89 



+ 




91 



H h 





1 

119 



+ • • • , 



where each term contains the digit 9, diverges; and that the series 



1+5 + 1 





where no term contains the digit 9, converges. (Thus the divergent 
series X))0Li 1/^ can be turned into a convergent one by weeding out 
all the terms that contain the digit 9. For a discussion of this and 
related matters, see [3].) 

.8 Prove that if p > 2, then the series 1/^^ is convergent. 

.9 Prove d’Alembert’s ratio test: let '^Onhe & series of positive terms 
such that 

? = lim ^ 

n-s-oo an 

exists; then X) converges if I < 1, and diverges if I > 1. (In the 
first case, choose r G (/, 1) and N such that 0 < a„+i < ra„ for all 
n > N.) 

Give examples where I = 1 and (i) X) converges, (ii) X) diverges. 
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.10 Prove that converges and has sum < 3. Show also that 

n n 

i:A-4<(i+h”<i:A 

fc =0 fc =0 

for all n > 3, and hence prove that lim„_>oo (l + l/^-- 

.11 Prove Cauchy’s root test: let ^a„ be a series of positive terms, and 

I = lim sup 

then ^ a„ converges if / < 1 , and diverges if / > 1 . 

.12 Discuss the convergence of the series 

2 3 ^ 32 ^ 33 ■ 

What does this series and Exercise (1.2.5: 6) tell you about the relative 
strengths of the ratio test and the root test? 



.13 Let (a„) be a decreasing sequence of nonnegative real numbers, and 
for each N let 



= ^ On, tN = ^ 2" 



Show that 



(i) if m < 2^ , then Sm < t]^, and 

(ii) if TO > 2^, then Sm > \tM- 

Hence prove that converges if and only if 2”a2" 

converges. 

.14 Use the preceding exercise to show that 1/''^^ converges if and 

only if p > 1 (cf. Exercises (1.2.14: 6 and 8)). 

.15 Let (an)((UQ and (6n)((Uo be sequences of real numbers, and for each 
N write Sn = '^n=o Show that if fc > j, then 

k k — 1 

^ ^ ^ ^ Sfi{bn ^n+l) — 

n^j n^j 

Now suppose that 

(i) there exists M > 0 such that |5n| < M for all n, 

(ii) hn> ^n+i for each n, and 
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(iii) lim„_>oo bn = 0. 

Prove that if k > j, then X)n=j < 2Mb j, and hence that 

^nbn converges. Use this result to give another proof of 
Leibniz’s alternating series test. 

A series ^ a„ of real numbers is said to be absolutely convergent if ^ |a„| 
is convergent. 

(1.2.15) Proposition. An absolutely convergent series is convergent. 

Proof. Let X)®" be absolutely convergent. Since the partial sums of 
X knl form a Cauchy sequence, for each e > 0 there exists N such that 

k 3 k 

^ ' l®n| ~ ^ ^ ' l®n| ^ ^ 

n—1 n—1 n—j 

whenever k > j > N. For such j and k we have 

k j k k 

^ ^ ^ ^ ^ ^ ^ ^ ^ |^n| 

n—1 n—1 'f^—j 'f^—j 

Thus the partial sums of X form a Cauchy sequence; whence X is 
convergent, by the completeness of R. □ 

The case p = 1 of Example (1.2.14:6) shows that the converse of 
Proposition (1.2.15) is false. 

By a power series we mean a series of the form X^o where the 

coefficients a„ G R. Such a series always converges for a; = 0, but it may 
converge for nonzero values of x. Its radius of convergence is defined to be 

{ OO 

r > 0 : On a:” converges whenever |a;| < r 

n=0 

if this supremum exists, and oo otherwise; and its interval of convergence 
is the largest interval I such that the power series converges for all x G /. 
It is an immediate consequence of Exercise (1.2.16: 10) that every power 
series has both a radius and an interval of convergence. 

(1.2.16) Exercises 

.1 Find an alternative proof of Proposition (1.2.15). 

.2 Prove that the series X^i converges absolutely if — 1 < r < 1. 
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.3 Let X) X be convergent series of nonnegative terms, with sums 
a, b, respectively, and let 

Un = OlX + 02^n-l + ’ ’ ’ + an-l&2 + Qn^l- 



Prove that 

^ (^ \(^ \ ^ 

2_^Un< \ Z^a-n \ 2^ On < > , '»2n 

n—1 \n—l / \n—l / n—1 

and hence that X '^n converges to the sum ab ( Cauchy ’s theorem on 
the multiplication of series). Extend this result to the case where the 
terms a„ may not be nonnegative but X ®n is absolutely convergent. 
(Writing 

OO 

b = X ^n, 

n—1 

k 

I3k = b- bn, 

n—1 

show that 

N N N 

^ ^ a„ + ^ Qk^N-k, 

n—1 n—1 k—0 

and hence that X^o ^kf^N-k — >■ 0 as fV — >■ oo.) 

.4 Show that (~1)” /\/n + 1 converges, but that the product (as 

in the preceding exercise) of this series with itself does not converge. 

.5 Prove that the exponential series 

exp(a:) = X] 

n\ 

n—O 

converges absolutely for all a: G R. Then prove that 
exp(x + y) = exp(a;) exp(y). 



.6 Prove that exp(x) = e“, where e = exp(l). (Use Exercise (1.1.4: 5).) 
Show that 

^ n! ^ (iV + 1)! 

n—0 

for each N, and hence calculate e with an error at most 10“®. 

.7 Prove that e is irrational. (Suppose that e = p/q, where p 
and q are positive integers. Choose N > max {<7,3}, show that 
i® integer, and use the inequality from the 
preceding exercise to deduce a contradiction.) 
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.8 Show that 

e“= lim 

n— >oo V n/ 

for each a; € R. (First take x > 0. Expand s„ = (1 + xfn)^ using the 
binomial theorem, and use the monotone sequence principle.) 

.9 For each n define 



7n = l + 5 + |H h^-logn 

Show that 



for each n, and hence that the sequence (q^) is decreasing and 
bounded below. It follows from the monotone sequence principle that 
Euler’s constant 

7 = lim 7 „ 

n—^oo 

exists. Show that 



N 



E 



(_l)n+l 

n 



= 12N ~1N + log 2 



and hence that 

(_l)n+i 

n 

(cf. Exercise (1.2.14: 6).) 



E 



log 2. 



.10 Let r > 0. Prove that if converges for x = r, then it 

converges absolutely whenever |x| < r; and that if this power series 
diverges for x = r, then it diverges whenever |x| > r. (For the first 
part, show that there exists M > 0 such that |a„x”| < M |x/r|" for 
all n.) 



.11 Find the radius of convergence and the interval of convergence for 
.12 Find the radius of convergence and the interval of convergence for 
.13 Suppose that a„ yf 0 for all n, and that 

On+l _ . 

a„ 

Show that if / = 0, then converges for all x G R; and that 

if I yf 0, then the series has radius of convergence l/l. 



lim 

n—¥oo 
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.14 Let (an)^o be a bounded sequence of real numbers, and let 

I = lim sup ly |a„|. 

Show that if / = 0, then converges for all x G R; and that 

if I yf 0, then the series has radius of convergence l/l. 

.15 Prove that if ( i/|a„|)^Q is an unbounded sequence, then the power 
series only converges for a; = 0. 

.16 Prove that the power series 

OO OO OO 

a„a;", na„a;"“^, and y^ 

n—0 n—1 n—0 

have the same radius of convergence. Need they have the same interval 
of convergence? 

By a rearrangement of an infinite series mean a series of 

the form ®/(n) where / is a permutation of (that is, a one-one 

mapping of N"*" onto itself). A theorem first proved by Riemann shows 
that if ^ a„ is a convergent, but not absolutely convergent, series of real 
numbers, then for each real number s there exists a rearrangement of ^ an 
that converges to s. The second exercise in the next set leads you through 
a proof of this remarkable result. 

(1.2.17) Exercises 

.1 Prove that if ^ a„ is absolutely convergent, with sum s, then any 
rearrangement of ^ a„ converges to s. (Given a permutation / of 
and a positive number s, choose N such that l®«l ^ Then 

choose M > N such that {1,2, ...,A^} C {/(I), /(2), . . . , /(M)} . 
Show that |X)r=i ®/(n) ~ < 2e for all m > M.) 

.2 Let be an infinite series of real numbers that converges but is 

not absolutely convergent. For each n define a+ = max{a„, 0} , a“ = 
min{a„,0} . Prove that the partial sums of the series 

form increasing unbounded sequences. Now let s be any real 
number. Let no and toq both equal 0, and let n\ be the least positive 
integer such that 

+ 1" > S- 

Show how to construct strictly increasing sequences (nfc)^g and 
(mfe)^Q of positive integers such that for each N > 0, 

N-l 

X! ((“";= + l ^ ^ + l H ^ «m;c+l)) < ® 
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and 

J2k=0 ((“^fc + l ^ <^tk+l) + + l H ^ «m;,+i)) 



+ 



(a 



+ 

riiv + l 



"AT+l 



> S. 



Hence obtain a rearrangement of that converges to s. 

.3 Let s be the sum of the series Show that the series 




converges to |log2. 



1.3 Open and Closed Snbsets of the Line 

In this section we introduce the fundamental topological notions of “open 
set” and “closed set” in R, notions that readily generalise in later, more 
abstract contexts. 

A subset A of R is said to be open (in R) if to each x G A there 
corresponds r > 0 such that the open interval {x — r, x + r) is contained in 
A — or, equivalently, such that y G A whenever \x — y\ < r. 

(1.3.1) Exercises 

.1 Prove that R itself, the empty set 0, and all open intervals are open 
subsets of R. 

.2 Give an example of a sequence of open subsets of R whose intersection 
is not open. 



The first result in this section describes the two fundamental properties 
of open sets. 

(1.3.2) Proposition. The union of any family of open sets is open. The 
intersection of any finite family of open sets is open. 

Proof. Let {Ai)i^j be any family of open sets. If x belongs to the union 
U of this family, then x G Ai for some i. As Ai is open, there exists r > 0 
such that 

{x — r,x + r) C Ai <ZU. 



Hence U is open. 
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Now let Ai, . . . , An be finitely many open sets, and consider any x in 
their intersection. For each i, since x G Ai and Ai is open, there exists 
Xi > 0 such that y G Ai whenever \x — y\ < Let 

r = min{ri, . . . ,r„} > 0. 

If \x — y\ < r, then y G Ai for each i, so y G nr=i Hence nr=i 
open. □ 

In view of Exercise (1. 3. 1: 2), we cannot drop the word “finite” from the 
hypothesis of the second part of Proposition (1.3.2). 

Our next aim is to characterise open sets in R; to achieve this, we first 
characterise intervals. 

A nonempty subset S of R is said to have the intermediate value property 
if (a, b) C S whenever a G S, b G S, and a < 6. Of course, as we show in 
Section 4, this notion is connected with the Intermediate Value Theorem 
of elementary calculus. 

(1.3.3) Proposition. A subset S of H has the intermediate value 
property if and only if it is an interval. 

Proof. It is clear that every interval in R has the intermediate value 
property. Conversely, suppose that S' C R has that property. Assume, to 
begin with, that S is bounded, and let a be its infimum and b its supremum. 
Note that x ^ S if either x < a or a; > &. If a and b both belong to S, then 
by the intermediate value property, so does every point of [a,b]; whence 
S = [a, 6]. If a G S and b ^ S, consider any x such that a < x < b. By 
the definition of “supremum”, there exists s G S such that a < x < s; the 
intermediate value property now ensures that x G S; whence S = [a, 6). 
Similarly, if a ^ S and b G S, then S = (a, b]. The remaining cases are left 
as exercises. □ 

(1.3.4) Exercises 

.1 Prove that a nonempty open subset of R with the intermediate value 
property is an open interval. 

.2 Complete the proof of Proposition (1.3.3) in the remaining cases. 

.3 Let I, J be open intervals with nonempty intersection. Prove that 
/ U J and / n J are open intervals. 

(1.3.5) Lemma. A nonempty family of pairwise- disjoint open intervals 
of-R is countable. 

Proof. Let .7^ be a nonempty family of pairwise-disjoint open intervals 
in R, and note that, by Exercise (1.1.1: 19), each of these intervals contains 
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a rational number. The Axiom of Choice (see Appendix B) ensures that 
there is a function / : .7^ — >• Q such that /(/) G I for each I G T. Since the 
sets in T are pairwise disjoint, / is one-one and so has an inverse function 
g mapping f{J^) onto T. As Q is countable and C Q, there exists a 
mapping h of N'*' onto the composite function g o h then maps 

onto IF, which is therefore countable. □ 

(1.3.6) Proposition. A nonempty subset of R is open if and only if it 
is the union of a sequence of pairwise- disjoint open intervals. 

Proof. It follows from Proposition (1.3.2) and Exercise (1.3.1: 1) that the 
union of any family of open intervals is an open set. Conversely, given a 
nonempty open subset S of R, define a binary relation ~ on S' by setting 
cc ~ j/ if and only if there exists an open interval / C S such that x,y G 
I. Then ~ is an equivalence relation: it is straightforward to prove the 
reflexivity and symmetry of and its transitivity follows from Exercise 
(1.3.4: 3). Clearly, the equivalence class i of a; under ~ is a union of open 
intervals and is therefore an open set. Consider points y, z G x and a real 
number t with y < t < z. Choosing open intervals Iy,Iz C S such that 
x,y G ly and x,z G Iz, we see from Exercise (1.3.4: 3) that ly U Iz is an 
open interval; so t G {y, z) C ly U Iz, and therefore either x,t G ly C S or 
else x,t G Iz C S. Hence x has the intermediate value property. It follows 
from Exercise (1.3.4: 1) that x is an open interval. Since any two distinct 
equivalence classes under ~ are disjoint, we now see that 

u * 

x^S 

is a union of pairwise-disjoint open intervals. Reference to Lemma (1.3.5) 
completes the proof. □ 

A real number x is an interior point of a set S' C R if there exists r > 0 
such that {x — r, x + r) C S. The set of all interior points of S is called the 
interior of S, and is written S° . By a neighbourhood of x we mean a set 
containing x in its interior. 

(1.3.7) Exercises 

.1 Let S be a nonempty open subset of R, and for each x G S consider 
the sets 



= {t G R : {x,f) C S}, 

Aa; = {s G R : (s, a;) C S} . 

Let a = mi Lx, b = supCA, and Ix = (a, 6), where a = —oo if Lx 
is not bounded below, and & = oo if Ca, is not bounded above. Give 
another proof of Proposition (1.3.6) by showing that {Ix)xsS is a 
family of disjoint open intervals whose union is S. 
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.2 Prove that the interior of an open, closed, or half open interval with 
endpoints a and b, where a < b, is the open interval (a, b). 

.3 Show that {S°)° = S°. 

.4 Prove that S° is the largest open set contained in S — in other words, 
that 

(i) S° is open and S° C S; 

(ii) if A is open and A C S', then A C S°. 

.5 Prove that S is open if and only if S C S°. 

.6 Prove that S° is the union of the open sets contained in S. 

.7 Prove that 

(i) if S C T, then S° C T°; 

(ii) (SnT)° = S°nT°. 

.8 Prove that U is a, neighbourhood of x G R if and only if there is an 
open set A such that x G A c U . 

Let X be a real number, and S a subset of R. We call x a cluster point 
of S if each neighbourhood of x has a nonempty intersection with S; or, 
equivalently, if for each £ > 0 there exists y € S such that \x — y\ < e. The 
closure of S (in R) is the set of all cluster points of S, and is denoted by 
S or S~ . S is said to be closed if S = S. 

(1.3.8) Exercises 

.1 Is Q closed in R? Is it open in R? 

.2 Show that the closure of any interval with endpoints a and b, where 
a < &, is the closed interval [a,b], 

.3 Show that (S') = S. 

.4 Prove that S is the smallest closed set containing S — in other words, 
that 

(i) S is closed and S C S; 

(ii) if A is closed and S C A, then S C A. 

.5 Prove that S is closed if and only if S C S. 

.6 Prove that S is the intersection of the closed sets containing S. 
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.7 Prove the following. 

(i) If S' C r, then S CT; 

(ii) SJJT = SUT. 

.8 Prove that 

(i) the complement of S° is the closure of R\S; 

(ii) the complement of S is the interior of R\S. 

.9 The boundary, or frontier, of a set S C R is the intersection of the 
closures of S and R\S. Describe the boundary of each of the following 
sets: R, 0, (a, 6] (where a < b), Q. 

.10 Prove that a belongs to the boundary of S\{a} if and only if a G 

.11 Let C be the Cantor set — that is, the subset of [0, 1] consisting of all 
numbers that have a ternary (base 3) expansion with 

a„ G {0,2} for each n. Prove that 

(i) if a, b are two numbers in C that differ in their mth ternary 
places, then |a — 6| > 3“™; 

(ii) C is a closed subset of R; 

(iii) C has an empty interior. 

What is the boundary of Cl 



(1.3.9) Proposition. S is closed if and only z/R\S' is open. 

Proof. Suppose that S is closed, and consider any x G RXS”. Since S = S, 
X is not a cluster point of S'; so there exists a neighbourhood U oi x that 
is disjoint from S. By Exercise (1.3.7: 8), there is an open set A such that 
X G A C U. Then A fl S = 0, so ^ C R\S; whence, by Exercise (1.3.7: 8), 
R\S is a neighbourhood of x, and therefore x G (R\S)°. Since x is any 
element of R\S, we conclude that R\S is open. 

Conversely, suppose that R\S is open. Then by Exercise (1.3.7: 8), R\S 
is a neighbourhood of each of its points. Since R\S is disjoint from S, it 
follows that no point of R\S' is in the closure of S. Thus if a; G S', then 
X ^ R\S and so x £ S. Hence S C S, and therefore, by Exercise (1.3.8: 5), 
S is closed. □ 

(1.3.10) Proposition. The intersection of a family of closed sets is 
closed. The union of a finite family of closed sets is closed. 
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Proof. Let {Ci)i^i be any family of closed sets, and for each i let Ai be 
the complement of Ci. Then 

f|a = R\(U^0- 

ie/ iG/ 

Since, by Proposition (1.3.9), each Ai is open. Proposition (1.3.2) shows 
that Uie/ open; whence, again by Proposition (1.3.9), its complement 
is closed. This completes the first part of the proof; the second is left as an 
exercise. □ 

(1.3.11) Exercises 

.1 Complete the proof of Proposition (1.3.10). 

.2 Give an example of a sequence of closed sets whose union is not 
closed. 

Which subsets of R are both open and closed? Before answering this 
question, we prove a simple lemma. 

(1.3.12) Lemma. If T is a nonempty open subset o/ R that is hounded 
above (respectively, below), then supT ^ T (respectively, infT ^ T). 

Proof. Consider, for example, the case where T is bounded above. Sup- 
pose that M = supT belongs to T. Since T is open, (M — r, M -|- r) C T 
for some r > 0. Hence M + ^r G T, which is absurd as M + \r > supT. 
Hence, in fact, M ^T. □ 

(1.3.13) Proposition. R and 0 are the only subsets o/R that are both 
open and closed in R. 

Proof. Exercise (1.3.1: 1) and Proposition (1.3.9) show that R and 0 are 
both open and closed in R. Let S' be a nonempty set that is both open and 
closed, and note that, by Proposition (1.3.9), R\S is also both open and 
closed. Suppose R\S is nonempty. Choosing a G S and b G R\S, we have 
either a < b or a > b. Without loss of generality we take the former case, 
so that 

T = {x G R\S : X > a} 

is nonempty and bounded below. By Proposition (1.3.2), T is also open, 
being the intersection of the open sets (a, oo) and R\S. Let m = inf T. 
Since S is open, there exists r > 0 such that (a — r, a-|-r) C S; whence 
m > a-l-r > a. Since, by Lemma (1.3.12), m ^ T, it follows that m ^ R\S' 
and therefore that m G S. But S is open, so there exists e > 0 such that 
(m — e,m + e) C S; this is impossible, since the definition of “infimum” 
ensures that there exists t G R\S' such that t < m + s. This contradiction 
shows that R\S' is empty; whence S' = R. □ 
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Let I be an interval in R, a a point of the closure of I, and / a real-valued 
function whose domain includes / but not necessarily a. A real number I is 
called the limit of f{x) as x tends to a in I, or the limit of f at a {relative 
to I), if to each e > 0 there corresponds i5 > 0 such that \f{x) — l\ < e 
whenever x G I and 0 < |x — a| < <5. We then write 

f{x) I as X ^ a, X G I 



lim f{x) = I 

x—¥a, x^I 

and we say that f{x) tends to I as x tends to a through values in I. 
The following are the most important cases of this definition. 

• a G 1°: in this case we use the simpler notations 

f{x) -G I as X ^ a 



and 

lim f{x) = 1. 

x—^a 

• I = {c, a) for some c < a (where c could be — oo): in this case we call I 
the left-hand limit oi f as x tends to a; we say that f{x) tends to I as 
X tends to a from the left (or from below); and we use the notations 

f{x) -G I as X ^ a~ 

and 

f{a~) = lim f{x) = 1 . 

x—¥a~ 

• I = {a, b) for some b > a (where b could be oo): in this case we call 
I the right-hand limit of / as x tends to a; we say that f{x) tends 
to I as X tends to a from the right (or from above); and we use the 
notations 

f{x) I as X ^ a~'~ 

and 

/(a+) = lim /(a;) = L 

x—^a+ 

We stress that although, in our definition of “limit”, f{x) is defined for 
all X in / that are distinct from but sufficiently close to a, f{a) need not 
be defined. For example, in elementary calculus courses we learn that 
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even though (sinx) /a; is not defined at x = 0. 

(1.4.1) Proposition. Iflim.x^a,x&if{x) = l andYivHx^a,x e/ f{x) — I', 
then 1 = 1'. 



Proof. Given e > 0, choose <5/, 5g >0 such that 

— if X G / and 0 < |x — a| < 5f, then |/(x) — l\ < ej2, and 

— if X G / and 0 < |x — a| < 5g, then |/(x) — l'\ < e/2. 

Setting 5 = min {^/, , consider any x G / such that 0 < |x — a| < <5. We 

have 

\l-l'\<\f{x)-l\ + \f{x)-l'\<^^ + l=e. 

Since e > 0 is arbitrary, it follows from Exercise (1.1.2: 4) that 1 = 1'. □ 



(1.4.2) Proposition. f{x) = I and liiaix^a,x€i g{x) = m, 

then as x ^ a through values in I, 

/(x) + g{x) -^l + m, 

/(x) - g{x) 

f{x)g{x) Im, 

max {/(x), (?(x)} — >-max{?,m}, 
min {/(x), g(x)} — >■ min{l,m}, 



If also m yf 0, then 



lim 

x—^a, X 



fix) 

6/ g{x) 



I 

m 



Proof. See Exercise (1.4.3: 3). □ 

(1.4.3) Exercises 

.1 Define precisely what it means to say that /(x) does not converge to 
any limit as x tends to a through values in I. In other words, give the 
formal negation of the definition of “convergent” . 

.2 Let the function / be defined in an interval whose interior con- 
tains a. Prove that lim^^-j-o /(x) = I if and only if lim 3 ,_,.£,+ /(x) and 
lim 2 ,_,.jj- /(x) exist and equal 1. 

.3 Prove Proposition (1.4.2). 

.4 Use the definition of “limit” to prove that lim^_j^^ 
for any polynomial function p and any a G R. 
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.5 Let p, q be polynomial functions, and a a real number such that q{a) yf 
0. Prove that 

lim 

x^a,x€R.q{x) q{a)' 

.6 Prove that lima,_>a_ f{x) = I if and only if for each sequence (a„) 
of elements of / that converges to a, the sequence (/(a„)) converges 
to 1. (For “only if”, use a proof by contradiction.) 

.7 Suppose that f{x) > r. Prove that there exists <5 > 0 such 

that if a; G / and 0 < |cc — a| < i5, then f{x) > r. 

.8 Let / be a real-valued function whose domain includes an interval 
of the form (s,oo), and let I G R. We say that f{x) tends to I as 
X tends to oo if to each £ > 0 there corresponds K > 0 such that 
\f{x) — l\ < s whenever x > K; we then write 

f{x) I as X ^ oo 



or 

lim f{x) = 1. 

x—^oo 

Convince yourself that analogues of Propositions (1-4.1) and (1.4.2) 
hold for limits as x tends to oo. 

Define the notion f{x) tends to I as x tends to — oo, written 
f{x) I as X ^ — oo 



or 

lim f{x) = I, 

x—^ — oo 

and convince yourself that analogues of Propositions (1.4.1) and 
(1.4.2) hold for this notion also. 

.9 Formulate definitions of the following notions, where / is an interval. 

(i) f{x) oo as X a through values in I. 

(ii) f(x) — >■ — oo as a; — >■ a through values in I. 

(iii) f(x) —>• oo as X —>• oo. 

(iv) f(x) oo as X ^ —oo. 

(v) f(x) — >■ — oo as x — >■ oo. 

(vi) f(x) — >■ — oo as x — >■ — oo. 

.10 Prove that if a > 1, then —>■ 0 as x —>■ —oo, and —>■ oo as 

X —>■ oo. What happens to as x —>■ ±oo when 0 < a < 1? (Note 

Exercise (1.2.3: 3).) 
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.11 Prove that if a > 1, then log^j x — >■ 0 as x — >■ — oo, and log^ x — >■ oo as 
X — >■ oo. What happens to log^j x as x — >■ ±oo if 0 < o < 1? 

.12 Let / be a real- valued function, and, where appropriate, define 

lim,c^^ sup /(x) = infr>o sup {/(x) : 0 < |x - ^| < r} , 

lim3;_,.5+ sup /(x) = infr>o sup {/(x) : 0 < x - ^ < J’} , 

lim^,^^- sup/(x) = infr>oSup{/(x) : 0 < ^ - X < r} . 

For example, in order that lim 3 ,_>^- sup /(x) be defined, it is necessary 
that / be defined and bounded"^ on some interval of the form — 
where r > 0. Prove the following. 

(a) lim 3 ,_>j sup /(x) < M if and only if for each e > 0 there exists 

(5 > 0 such that /(x) < M + e whenever 0 < |x — ^| < <5. 

(b) lima;_>j sup /(x) > M if and only if for each pair of positive 
numbers e, 6 there exists x such that 0 < |x — ^| < 5 and /(x) > 
M-e. 

Formulate appropriate definitions of the quantities lima, inf /(x), 
lim 2 ,_,.j+ inf /(x), and lim 2 ,_,.j~ inf /(x). Prove that 

(c) lima; inf /(x) < liiHa-s-^ sup /(x), and these two numbers are 
equal if and only if I = lima;_>^ fi^) exists, in which case the 
numbers equal 1. 

An important special case of the notion of a limit occurs when the 
function / is defined at the point a that we are approaching. 

A function / defined in some neighbourhood of a is said to be continuous 
at a if /(x) — >■ /(a) as x — >■ a; in other words, if for each e > 0 there exists 
5 > 0 such that |/(x) — f{a)\ < e whenever |x — a| < S. We also say that 

/ is 

• continuous on the left at a if f is defined on the interval (a — r, a] for 
some r > 0 and 

f{a~) = lim /(x) = /(a); 

x—^a 

• continuous on the right at a if / is defined on the interval [a, a + r) 
for some r > 0 and 

/(a+) = lim /(x) = /(a); 

x—^a'^ 

^By allowing the lim sup quantities to take the values ±oo, in a sense that is 
made precise in Section 3.1, we can remove the restriction that / be bounded 
near 
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• continuous on the interval I f{x) = f{t) for each t G I. 

Note that the last definition takes care of the one-sided continuity of / at 
those endpoints of /, if any, that belong to I. 

If / is defined in a neighbourhood of a but is not continuous at a, we say 
that / has a discontinuity, or is discontinuous, at a. 

(1.4.4) Proposition. Let the real-valued functions f and g be continu- 
ous at a. Then / + 5, f — g, fg, max {/, g} , min {/, 5} , and \ f\ are contin- 
uous at a. If also g{x) yf 0 for all x in some neighbourhood of a, then f/g 
is continuous at a. 

Proof. This is a simple consequence of Proposition (1.4.2). □ 

(1.4.5) Exercises 

.1 Let / be defined on a neighbourhood of a. Prove that / is continuous 
at a if and only if it is continuous on both the left and the right at a. 

.2 Give the details of the proof of Proposition (1.4.4). Extend this result 
to deal with continuity on an interval I. 

.3 Prove that a polynomial function is continuous on R. 

.4 Let p, q be polynomial functions, and a a real number such that q(a) yf 
0. Prove that the rational function p/q is continuous at a. 

.5 Let / be continuous at a, and let g be continuous at /(a). Prove that 
the composite function g o f is continuous at a. 

.6 Prove that / is continuous at the point a G R if and only if / is 
sequentially continuous at a, in the sense that /(a„) -G f{a) whenever 
(a„) is a sequence of points of the domain of / that converges to a. 

.7 Let / be defined in an interval {a — r, a -\- r) where r > 0. The 
oscillation of f at a is 

w(/. a) = lim sup {f{x) - f{y) : x,y G {a - 5, a + 6)} . 

<5— >0 

Prove that / is continuous at a if and only if w(/, a) = 0. 

.8 Let / be an increasing function on [a, 6]. Prove that f{£,~) exists for 
each f G (a, b], and that f{£f^) exists for each f G [a, 6). By considering 
the sets 

{x G (a,b) : |/(x+) - /(x")| > , 

with n a positive integer, prove that the set of points of [a, b] at which 
/ has a discontinuity is either empty or countable. 
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.9 Let <7o, 9i, • ■ • be a one-one enumeration of Q PI [0, 1], and for each 
X G [0,1] define 

T{x) = {n G N : Qn < x} . 

Define a mapping / : [0, 1] R by 

f 0 if a: = 0 

fix) = < 

I E„eT(.)2-” if0<x<l. 

Prove that 

(i) / is strictly increasing, 

(ii) / is continuous at each irrational point of [0, 1], and 

(iii) / is discontinuous at each rational point of [0, 1]. 

.10 Let (/n)^i be a sequence of functions on an interval I, and suppose 
that there exists a convergent series of nonnegative terms 

such that |/n(a:)| < for each x G I and each n. Prove that for 
each e > 0 there exists N such that 0 < Yf!h=j+i l/«(^)l ^ ^ whenever 
k > j > N and x G I. Hence prove that f{x) = fn{x) defines 

a function on / ( Weierstrass’s M-test). Prove also that if each /„ is 
continuous on I, then so is /. 

.11 Give two proofs that exp is a continuous function on R. 

.12 Prove that if a > 0, then the function a; i— is continuous on R. 
(Note that = exp(xloga).) 

.13 Prove that if a > 0, then the function x i— log^, x is continuous on R. 
(First take the case a > 1. Given a; > 0 and e > 0, choose a positive 
integer n > 1/e, and then 5 G (0, x) such that (x + S) /x < and 
(x — 5) /x > 

.14 Prove that the functions sin and cos, defined by 

“ (_l)n^2„+l 

(2n+l)! ’ 

“ (-l)”x2" 
n=o (2n)! 

are (well defined and) continuous on R. 

.15 Let / be the interval of convergence of the power series f{x) = 
Prove that / is continuous on I. 

.16 Prove that if ^ convergent series, then 



sinx = 



cosx = 
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(i) converges for all a; € (—1,1), and 



(ii) 



for each £ > 0 there exists 6 G (0, 1) such that if 1 — cr < a: < 1, 
then 



OO 



E 



an 



n—0 



OO 

a«a;” 

n—0 



< e. 



Thus lim 2 ._>i- ~ {Abel’s Limit Theorem). (For 

(ii), note that 



OO OO 

E “ E 


< 


N 

^«n(l-x") 


n—0 n—0 




n—0 



E 

n—N+1 



anx" 

n—N-^1 



for each N. Use Exercise (1.2.14: 15) to handle the last term on the 
right.) 

.17 Let convergent series with sums a, 6, respec- 

tively, and let 



Un — 0,ib„ + a2&n-l d" ’ ’ ’ d" an-1^2 d" Qn&l. 

Prove that if converges, then its sum is ab. (For — 1 < a; < 1 

set f{x) = ~ Then use Exercises 

(1.2.16:10), (1.2.16:3), and (1.4.5: 16).) 

This is the full form of Cauchy’s theorem on the multiplication of 
series, and should be compared with Exercise (1.2.16:3). 



Deeper results about continuity — indeed, many results in real-variable 
theory — depend on two fundamental properties of the real line, described 
in our next two theorems. 

By a cover of a subset S' of R we mean a family 14 of subsets of R such 
that S C IJ ZY; we then say that S is covered by U and that U covers S. If 
also each U G ZY is an open subset of R, we refer to U as an open cover of 
S (in R). By a subcover of a cover ZY of S we mean a family T <ZU that 
covers S; if also iF is a finite family, then it is called a finite subcover of U. 

Although there exist shorter proofs of the next theorem (see the next set 
of exercises), the one we present is adapted to prove a more general result 
(Theorem (3.3.9)) in Chapter 3. 

(1.4.6) The Heine— Borel—Lebesgue Theorem. Every open cover of 

a compact interval I in H contains a finite subcover of I. 
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Proof. Suppose there exists an open cover U of I that contains no finite 
subcover of I. Either the closed right half of / or the closed left half (or 
both) cannot be covered by a finite subfamily of U : otherwise each half, and 
therefore / itself, would be covered by a finite subfamily. Let Ii be a closed 
half of / that is not covered by a finite subfamily of U. In turn, at least 
one closed half, say I 2 , of Ii cannot be covered by a finite subfamily of 14. 
Carrying on in this way, we construct a nested sequence I D Ii D I 2 ■■ ■ 
of closed subintervals of I such that for each n, 

(a) |/„| =2-"|/| and 

(b) no finite subfamily of U covers 

By the nested intervals principle (1.2.6), there exists a point ^ G H^i 
Clearly ^ € I , so there exists U £14 such that ^ £ U. Since U is open, there 
exists r > 0 such that if |x — ^| < r, then x £ U. Using (a), we can find N 
such that if x G 1^, then |x — ^| < r and therefore x £ U; thus In C U. 
This contradicts (b). □ 

A real number a is called a limit point of a subset S' of R if each neigh- 
bourhood of a intersects S\{a}; or, equivalently, if for each £ > 0 there 
exists X £ S with 0 < |x — a| < £. By a limit point of a sequence (a„) we 
mean a limit point of the set {ai, 02 , . . .} of terms of the sequence. 

A nonempty subset A of R is said to have the Bolzano-Weierstrass 
property if each infinite subset S of A has a limit point belonging to A. 

(1.4.7) The Bolzano-Weierstrass Theorem. Every compact interval 
in R has the Bolzano-Weierstrass property. 

Proof. Let / be a compact interval, and S an infinite subset of I. By 
Corollary (1.2.8), any infinite sequence of distinct points of S contains a 
convergent subsequence; the limit of that subsequence is a limit point of S 
in the closed set I. □ 

(1.4.8) Exercises 

.1 Let AT be a subset of R with the Bolzano-Weierstrass property, and 
let (xn) be a sequence of points in X. Show that there exists a 
subsequence of (xn) that converges to a limit in X. (Note Lemma 
(1.2.7).) 

.2 Fill in the details of the following alternative proof of the Heine- 
Borel-Lebesgue Theorem. Let 14 be an open cover of the compact 
interval I = [a, b] , and define 

A = {x G / : [a,x] is covered by finitely many elements of 14} . 

Then A is nonempty (it contains a) and is bounded above; let f = 
sup A. Suppose that b, and derive a contradiction. 
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.3 Fill in the details of the following alternative proof of the Bolzano- 
Weierstrass Theorem. Suppose the theorem is false; so there exist a 
compact interval I and an infinite subset S' of / such that no limit 
point of S belongs to I. Construct a nested sequence I D Ii D I 2 D ■■ ■ 
of closed subintervals of / such that for each n, 

(a) |/„| = 2-"|/|, 

(b) S n is an infinite set, and 

(c) S n /„ has no limit points in 

Let 5 G show that ^ is a limit point of S. (This is 

one of the commonest proofs of the Bolzano- Weierstrass Theorem in 
textbooks.) 

.4 Here is a sketch of yet another proof of the Bolzano- Weierstrass The- 
orem for you to complete. Let / be a compact interval, and S an 
infinite subset of I; then the supremum of the set 

H = {x G / : S' n (— 00 , x) is finite or empty} 
is a limit point of S in /. 

.5 Let S be a subset of R with the Bolzano- Weierstrass property. Prove 
that S is closed and bounded. (For boundedness, use a proof by 
contradiction.) 

.6 Show that the Bolzano- Weierstrass Theorem can be proved as a con- 
sequence of the Heine-Borel-Lebesgue Theorem. (Let / be a compact 
interval in R, assume the Heine-Borel-Lebesgue Theorem (1.4.6), 
and suppose that there exists an infinite subset S' of / that has no 
limit point in /. First show that for each s G S there exists > 0 
such that S n (s — Tg, s -I- r^) = {s}.) 

.7 Let / be a real-valued function defined on an interval I. We say that / 
is uniformly continuous on / if to each e > 0 there corresponds <5 > 0 
such that \f{x) — f{x')\ < e whenever x,x' G I and \x — x'\ < 5. 
Show that a uniformly continuous function is continuous. Give an 
example of I and / such that / is continuous, but not uniformly 
continuous, on I . 

.8 Use the Heine-Borel-Lebesgue Theorem to prove the Uniform Con- 
tinuity Theorem: a continuous real-valued function / on a compact 
interval / C R is uniformly continuous. (For each e > 0 and each 
X G /, choose 5a; > 0 such that if x' G / and |x — x'| < 25a;, then 
|/(x) — /(x')| < e/2. The intervals (x — 5a;, x -I- 5a;) form an open 
cover of I.) 
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.9 Prove the Uniform Continuity Theorem (see the previous exercise) us- 
ing the Bolzano- Weierstrass Theorem. (If /:/—>■ R is not uniformly 
continuous, then there exists a > 0 with the following property: for 
each n £ there exist x„,yn £ I such that \xn — yn\ < ^/n and 
\f{Xn) - f{yn)\ > a.) 

The proof of the following result about boundedness of real-valued 
functions illustrates well the application of the Heine-Borel-Lebesgue 
Theorem. 

(1.4.9) Theorem. A continuous real-valued function f on a compact 
interval I is bounded; moreover, f attains its bounds in the sense that 
there exist points ^,rj of I such that f{f) = inf / and f{r]) = sup /. 

Proof. For each x £ I choose (5a, > 0 such that if x' £ I and \x — x'\ < Sx, 
then \ f{x) — f{x')\ < 1. The intervals (x — Sx,x + Sx), where x £ I, form 
an open cover of I. By Theorem (1.4.6), there exist finitely many points 
xi, . . . ,xn of I such that 



N 

/ C y (a;fc - Sxk , T ) ■ 



c= 1-f max{|/(a;i)|,...,|/(a:Ar)|}, 

and consider any point x £ I. Choosing k such that x £ (xk — Sx^, , Xk + Sx,, ) , 
we have 



|/(a;)| < lf(x) - f(xk)l + |/(a:fc)| 

< 1 + \f{xk)\ 

< c, 



so / is bounded on /. 

Now write 

m = inf /, M = sup /. 

Suppose that f{x) yf M, and therefore f{x) < M, for all x £ I. Then 
X I— >■ 1/(M — f{x)) is a continuous mapping of / into R’*’, by Proposition 
(1.4.4), and so, by the first part of this proof, has a supremum G > 0. For 
each X £ I we then have M — f{x) > 1/G and therefore f{x) < M — 1/G. 
This contradicts our choice of M as the supremum of /. □ 

(1.4.10) Exercises 

.1 Prove both parts of Theorem (1.4.9) using the Bolzano- Weierstrass 
Theorem and contradiction arguments. 
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.2 Let / be a continuous function on R such that f{x) — >■ oo as a: — >■ ±oo. 
Prove that there exists ^ G R such that f{x) > /(^) for all x G R. 

.3 Let / be a continuous function on R such that f{x) — >■ 0 as a; — >■ ±oo. 
Prove that / is both bounded and uniformly continuous. 

(1.4.11) The Intermediate Value Theorem. If f is a continuous 
real-valued function on an interval I, then f{I) has the intermediate value 
property (page 36). 

Proof. Let a, b be points of I, and y a real number such that /(a) < y < 
f{b); without loss of generality assume that a < b. Then 

S' = {a; G [a, b] : f{x) < y} 

is nonempty (it contains a) and bounded above by b, so f = sup S exists. 
Note that f < b and that (^,6] C I. We show that f{f) = y. To this 
end, suppose first that f{f) < y. Then, by Exercise (1.4.3: 7), there exists 
(5 G (0, b—f) such that if a: G / and |a; — ^| < 6, then f{x) < y; in particular, 
f{x) < y for all x £ (C; ■? + <^)) which contradicts the definition of f as the 
supremum of S. Thus f{f) > y. 

Now suppose that /(^) > y; then ^ > a. By another application of 
Exercise (1.4.3: 7), there exists 6' G (0,^ — a) such that if ^ — i5' < a; < ^, 
then f{x) > y. This is impossible, since, by the definition of “supremum”, 
there exist points x of (a, ^) arbitrarily close to f with f{x) < y. Hence 
f{f) < y, and therefore f{f) = y. □ 

(1.4.12) Corollary. Let f be a continuous real-valued function on a 
compact interval /, and let m = inf f,M = sup /. Then f{I) = [m, M], 

Proof. Use Theorems (1.4.9) and (1.4.11). □ 

(1.4.13) Exercises 

.1 Fill in the details of the following common proof of the Interme- 
diate Value Theorem. Let f{a) < y < f{b), write oq = a, bo = 
6, Co = |(ao + bo), and assume without loss of generality that a < b. 
If /(cq) = y, there is nothing to prove and we stop our construc- 
tion. Otherwise, by repeated interval-halving, we construct points 
0 - 0 , bo, Co, oi, 6i, Cl, . . . such that 

— either /(c„) = 0 for some n and the construction stops, 

— or else the construction proceeds ad infinitum, an < Un+i < 

bn-\-l 'Ll bn, f {^n) ^ y, f {bn) ^ y, Cn — 2 (^n 4“ ^n) , and 

0 < - a„ = (i)" {b- a). 
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Choosing x € show that f(x) = 0. 

.2 Use the Intermediate Value Theorem to prove that if & > 0 and n is 
an odd positive integer, then b has an nth root — that is, there exists 
r G R such that r" = b. (Of course, this result also follows from our 
definition of in Section 1; but it is instructive to see how it can be 
derived by other means, such as the Intermediate Value Theorem.) 

.3 Show that any polynomial equation 

X dn—lX “t“ * * * “t“ diX -j- dQ = 0 

of odd degree n, with coefficients Ofc G R, has at least one real 
solution. 

.4 What can you say about a function / that is continuous on [0, 1] and 
assumes only rational values? 

.5 Let /, g be continuous functions on [0, 1] such that f(x) G [0, 1] for all 
X, g(0) = 0, and (?(1) = 1. Show that f(x) = g(x) for some x G [0, 1]. 

.6 Prove that there is no continuous function / : R — >■ R that assumes 
each real value exactly twice. 

.7 Let / be continuous and one-one on an interval I; then /(/) is an 
interval, by Corollary (1.4.12). Prove that 

(i) either / is strictly increasing on / or else / is strictly decreasing 
on /; 

(ii) if a G 1°, then /(a) G f(I)°; 

(iii) f~^ is continuous on /(/). 

(For (iii), show that / is sequentially continuous at each point of 
/(/). You will need Corollary (1.2.8), Exercise (1.4.5: 6), and Exercise 
(1.2.9: 4).) 



Although the Intermediate Value Theorem has many applications, espe- 
cially in the solution of equations, none of its proofs provides an algorithm 
for constructing the point x with f{x) = y. This claim may come as a 
surprise: for is not the interval-halving proof in Exercise (1.4.13: 1) algo- 
rithmic? Alas, it is not: for, as any good computer scientist knows, there 
is no algorithm that enables us to decide, for given real numbers y and z, 
whether y = z or y ^ z. (For further discussion of these matters, see the 
Prolog of [5], and pages 65-66 of [8].) 
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In this section we cover the fundamentals of the differential and integral 
calculus of functions of one real variable. We do so rapidly, leaving many 
details to the exercises, on the assumption that you will have seen much of 
the material in elementary calculus courses. 

Let I be an interval in R, xg a point of I, and / a real- valued function 
whose domain includes I. We say that / is 



• differentiable on the left at xg if its left-hand derivative at xg, 



f'(xg ) 



lim 



fix) - fjxg) 

X — Xq 



lim 

h—¥0~ 



fjxg -\-h) - f{xg) 
h 



exists; 



• differentiable on the right at xg if its right-hand derivative at xg, 



/'Co) 



lim 



fix) - fjxg) 
X — Xg 



lim 

/1-S-0+ 



fjxg + h)~ fjxg) 

h 



exists; 



• differentiable at xg if xg is an interior point of I and the derivative of 

f at Xg, 



/'Co) 



exists. 



lim 

X—¥Xq 



fix) - fjxg) 
X — Xo 



fjxg + h)~ fjxg) 
h->0 h 



It follows from Exercise (1.4.3: 2) that / is differentiable at an interior point 
Xg of its domain if and only if f'ixff) and f'ix^) exist and are equal, in 
which case their common value is f'ixg). 

We say that / is differentiable on the interval I if it is 

— differentiable at each interior point of I, 

— differentiable on the right at the left endpoint of / if that point belongs 
to /, and 

— differentiable on the left at the right endpoint of I if that point belongs 
to I. 



Higher-order derivatives of / are defined inductively, as follows. 

/^°^ = /, 

/^'^ = /', 

/( 2 ) ^ fO ^ 

/( 3 ) ^ yn ^ 

/(”+!) = (/(”))' (n>3). 
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If the nth derivative exists, then / is said to be n— times dif- 
ferentiable at X] if exists for each positive integer n, then / is 

said to be infinitely differentiable at x. Definitions of notions such as nth 
right-hand derivative, n— times differentiable on an interval, and infinitely 
differentiable on an interval are formulated analogously. 

(1.5.1) Exercises 

.1 Prove that if / is differentiable at xg, then it is continuous at xg. Give 
an example of a function / : R — >■ R such that /'(O ) and /'(0+) 
both exist but / is not continuous at 0. 

.2 For each x G R write 

p(x, Z) = inf {\x — n| : n G Z} 



and 



OO 



fix) = Y, 

n—Q 



p(10”x,Z) 



Prove that / is continuous, but nowhere differentiable, on R. (For 
continuity use Exercise (1.4.5:10). To show that / is not differen- 
tiable at X, it is enough to take 0 < x < 1. Let 0.difi2 ... be a deci- 
mal expansion of x, the terminating expansion if there is one. Define 
hk to be —10“^ if Ofc = 4 or 9, and 10“* otherwise, and consider 
ifix + hk) - fix)) .) 

This example is due to van der Waerden [54]. Weierstrass, in a lecture 
to the Berlin Academy in 1872, gave the first example of a continuous, 
nowhere differentiable function: namely. 



fix) = g" cos(6"7rx), 

n=l 



where 0 < a < 1, 5 is an odd positive integer, and ab > l-\- 37t/ 2; for 
a discussion of a special case of Weierstrass’s example, see [28] , pages 



38-41. 



.3 Prove that if / is differentiable at x, then 



fix) 



lim 

h,k^0+ 



fjx-\-h) - /(x 
h-\- k 



k) 



Give an example of a function / where lim/i_>o Hfih) — f{—h)) /2h) 
exists but / is not differentiable at 0. 

.4 Let fix) = x”, where n is an integer. Using the definition of 
“differentiable”, prove that fix) = nx"“^ for all x G R. 
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.5 Let / and g be differentiable at a. Prove that f + g, f — g, cf {c £ H), 
and fg are differentiable at a, and that 

(/ + ff)'(a) = f{a) + g'{a), 

(/-5)'(a) = f{a)-g'{a), 

(c/)' (a) = c/'(a), 

(/ff)'(a) = f{a)g'{a) + f'{a)g{a). 



.6 Under the conditions of the last exercise, suppose also that g{a) ^ 0. 
Give two proofs that f /g is differentiable at a, and that 

f'{a)g{a) - f{a)g'{a) 

g{aY 

.7 Using the exponential series, prove that exp'(O) = 1. Hence prove 
that exp'(x) = exp(a:) for all x G R. 




Our next proposition, the Chain Rule, is possibly the most troublesome 
result of elementary calculus. 

(1.5.2) Proposition. If f is differentiable at a, and g is differentiable 
at f{a), then g o f is differentiable at a, and 

{g°fy (a) = g' if (a)) ■ f{a). 



Proof. Setting b = f{a), define 



h{u) 



g{u) - g{b) 
u — b 



if M yf 6 



g'{b) ifu = b. 



For all X yf a in some neighbourhood of a we have 



gjfjx)) - g{f{a)) 

X — a 



{h o f) (x) 



/(x) - /(g) 
X — a 



( 1 ) 



(Note that in verifying this identity, we must consider the possibility that 
/(x) = f{a).) Since g is differentiable at 6, h is continuous at b. Moreover, 
/ is differentiable, and therefore (by Exercise (1.5.1: 1)) continuous, at a; 
so o / is continuous at a, by Exercise (1.4.5: 5). Hence 



lim {{ho f) (x) 



/(x) - f{a) 



lim {h o /) (x) • lim 

x—¥a x—¥a 



/(x) - /(a) 



{hof) (a) • f'{a) 
g{b)f{a). 



X — a 



X — a 
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The result now follows immediately from (1). □ 

(1.5.3) Proposition. Let I be an open interval, f a one-one continuous 
function on I, and a € I, such that f'{a) exists and is nonzero. Then the 
inverse function f~^ is differentiable at /(a), and 

{/-■)'(/(«)) = 

Proof. Note that J = /(/) is an interval, by Theorem (1.4.11) and 
Proposition (1.3.3); moreover, by Exercise (1.4.13:7), /(a) is an interior 
point of J and f~^ is continuous on J. Let (j/„) be any sequence in J\{f{a)} 
that converges to /(a), and write = f~^{yn)', then yf a and 

lim f~^{xn) = f~^{f{a)) = a. 

n—^oo 

Since / is one-one, it follows that 

fjxn) - f{a) ^ Q. 

Xn- a 



whence 



Vn - f{a) f{xn) - f{a) f'{a) 

The desired conclusion now follows from Exercise (1.4.3: 6). □ 

(1.5.4) Exercises 

.1 Prove that log'(x) = 1/x for each a: > 0. 

.2 Let f{x) = x'' , where r G R. Prove that f'{x) = (Note that 

x'" = exp(rloga;).) 

.3 Let / be a strictly increasing function on an interval I, and let a be 
a point of I such that f'{a~^) exists and is nonzero. Prove that the 
inverse function f~^ is differentiable on the right at /(a), and that 

(r')' (Hyr) - 

.4 Let / be continuous on the compact interval / = [a, b] and differ- 
entiable on (a,b). Prove that if C G (o, 6) and f{f) = inf/, then 
f'if) = 0. Hence prove that if f{a) = f{b), then there exists f G (a, b) 
such that /'(/) = 0 {Rolle’s Theorem). 
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5 Let / be continuous on the compact interval [a, b] and n-times differ- 

entiable on (a, b). Suppose that there exist n+1 distinct points x of 
(a, b) at which f{x) = 0. Show that = 0 for some x G (a, b). 

6 Use Rolle’s Theorem to prove the Mean Value Theorem: if / is con- 
tinuous on the compact interval [a, b] and differentiable on (a, b), then 
there exists ^ G (a, b) such that f{b) — /(a) = /'(^)(6 — a). 

7 Let / be differentiable on an interval I. Prove that 

(i) if f'{x) > 0 for all x G I, then / is increasing on I; 

(ii) if f'{x) > 0 for all x G I, then / is strictly increasing on I; 

(iii) if f'{x) = 0 for all x G I, then / is constant on I. 

8 Let / be differentiable on an interval I, with f'{x) yf 0 for all x G I. 

Prove that / is one-one, and that either f'{x) > 0 for all a; G / or 
else f'{x) < 0 for all x G I. 

9 Prove that if / is differentiable on an interval /, then the range of /' 
has the intermediate value property on I. (Let f'{xi) < y < f'{x 2 ), 
consider g{x) = f{x) — yx, and use the preceding exercise.) 

10 Prove Cauchy’s Mean Value Theorem: if f,g are continuous on [a, 6] 
and differentiable on (a,b), then there exists ^ G (a,b) such that 

{f{b) - f{a)) g'iO = {g{b) - g{a)) f'{0- 
(Consider the function x (f{b) — f{a)) g{x) — (g{b) — g{a)) f{x).) 

11 Let f,g be continuous on [a,b] and differentiable on (a,b), let xq G 
[a, b] , and suppose that 

(i) g'{x) yf 0 for all x ^ xq, 

(ii) f(xo) = g(xo) = 0, 

(iii) 1= lim (/'(a;) /g'(a;) ) exists. 

X->-Xq 

Prove that lima,_>a;p (f{x) /g{x)) = 1. {THopitaTs Rule. Use the pre- 
ceding exercise to show that if (a;„) is any sequence in [a, 6]\{a:o} 
that converges to xq, then /(x„)/g(x„) — >■ Z as n — >■ oo.) 

12 Let g be twice differentiable at 0, with ^(0) = g'(0) = 0. Find 
where / is defined by 



fix) = 



gjx) 

X 



if X yf 0, 



0 



if X = 0. 
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The following generalisation of the Mean Value Theorem is one of the 
most useful results of the differential calculus. Unfortunately, it seems 
to have no completely transparent, natural proof; all the proofs in the 
literature use some trick or other to obtain the desired conclusion. 



(1.5.5) Taylor’s Theorem. Let f be {N +l)-times differentiable on an 
interval /, and let a € I. Then for each x € I there exists ^ between a and 
X such that 



/(^) = E - «)” + - «)• 



n=0 



Proof. Fixing x G I, without loss of generality we take a < x. Consider 
the function g : [a, x] — >■ R defined by 

9{t) = f{x)-Y,L^(^^_tr. 



n—O 

Using Exercises (1.5.1: 4 and 5), we have 



9 



'(f) = -fit) - ^ 



, ix-tr-f^{x-tr-^ 

nl [n — 1): 






Nl 



(x-t)r 



Applying the Mean Value Theorem (Exercise (1.5.4: 6)), we obtain ^ G 
(a, x) such that 






Then, as g(x) = 0, 



V! 



9(a) = (a;-C) (a: -a). 



which is equivalent to the desired result. □ 
The expression 

n\ 



n—Q 



is called the Taylor polynomial of degree n at a, and 



N 



- E 



/(")(a) 



(x — a)^ 



n—0 
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is called the remainder term of order n in Taylor’s Theorem. The theorem, 
as stated, has the remainder term in the Cauchy form: 

(a; -a). 

The next corollary gives us an alternative form — the Lagrange form — of 
the remainder. 



(1.5.6) Corollary. Under the hypotheses of Taylor’s Theorem, for each 
X G I there exists t between a and x such that 



n=0 



n! 



{N+l)l 



Proof. Again we take a < x and use a trick. With g the function in- 
troduced in the preceding proof of Taylor’s Theorem, we apply Cauchy’s 
Mean Value Theorem (Exercise (1.5.4: 10)) to g and 1 {x — t)^~^^. Since 
g{x) = 0, this yields t £ (a,x) such that 

g(g) ^(.-tr 

(x-a)^+i (N+l)(x-t)^ ' 



Hence 



ff(a) 



f(^+i)(t) 

(A^+1)! 



(x 






from which the required conclusion follows. □ 



(1.5.7) Proposition. Let L he the interval of convergence of the power 
series series on L. Then f is 

differentiable, and f'{x) = each interior point of L. 

Proof. We first recall from Exercise (1.2.16:16) that the power series 
a„a;" and have the same radius of convergence!?. 

Given x £ L°, let 

r= i(|x|-ki?). 

Then converges absolutely, by Exercise (1.2.16: 10). Given 

£ > 0, choose N such that l®"l < e. If h ^ 0 and \x + h\ < 

r, then, using the Mean Value Theorem, for each n > fV -I- 1 we obtain 6*„ 
between 0 and h such that 

(x -k /i)” - x" 



h 



n(x-|-0„)” ^ 
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Hence 



E 

oo 

^ E 






n—N-\-l 



{x + h)" - ; 






< ^ n\an\\{x + 9nT ^-a:” 



n=N+l 



oo 

< ^ n|a„| (^|x + 6»„|”“^ + 



oo 



It follows that 



<2 ^ n|a„|r" 

n— A^+1 
< 2 £. 



/(x+k)-/(x) ^ 



-T.'' 



< 



N 

E' 



n—1 



{x + /l)” - ; 



h 



+ 



E 

n=N+l 



(x + /z)^ 
h 



— nx 



N 

n—1 






■2e. 



Now, there exists 5 > 0 such that if 0 < \h\ < S, then \x + h\ < r and 
|(x + h)"-x" 



N 



< l + ^|a„| £ (l<n<lV). 



n—1 



For such h we then have 

\f{x+h) - f{x) 



-E 



UQnX 



< 3 £. 



Since £ > 0 is arbitrary, the result follows. □ 
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(1.5.8) Exercises 



.1 Let / be infinitely differentiable on an interval /, and suppose that 
there exists M > 0 such that |/^"^(a;)| < M for all sufficiently large 
n and all x € I. Given a € I, prove that the Taylor expansion, or 
Taylor series, of f about a, 




n=0 



converges to f{x) for each x € I. 

.2 Find the Taylor expansion of exp(— about 0. For what values of 
X does this expansion converge to exp(— x^)? 

.3 Prove that /(x) = exp(x) defines the unique differentiable function 
such that /(O) = 1 and /'(x) = /(x) for all x G R. 

.4 Let R > 0, and let power series whose 

intervals of convergence include {—R, R). Suppose that = 

for all X G {—R, R)- Show that a„ = bn for all n. 

.5 Prove that sin'x = cosx and that cos'x = — sinx for each x G R. 
(See Exercise (1.4.5: 14) for the definition of sin and cos .) 

.6 Prove the trigonometric addition formulae: 



cos(a + 6) = cos a cos 6 — sin a sin &, 

sin(a + &) = sin a cos 6 + cos a sin &. 

(Define 

F{x) = (cos(x + 6) — cos X cos 6 + sinx sin 6)^ 

+ (sin(x + 6) — sin x cos b — cos x sin b)^ 



and consider F'{x).) 

.7 Prove that cos is a strictly decreasing function in the interval [0,2], 
and that there is a unique p such that p/2 G [0, 2] and cos (p/2) = 0. 
Using the addition formulae from the preceding exercise, prove also 
that cos(x + 2p) = cosx and sin(x + 2p) = sinx for all x G R. (Of 
course, the number p is more usually denoted by tt.) 

.8 Derive the binomial series 



(l + x)« 




X 



n 
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for — 1 < a; < 1, where (g) = 1 and for n > 1, 

/ q;\ a{a — l)(o; — 2) ■ ■ ■ {a — n + 1) 

\n J n! 

(First show that the series in question does converge for |a;| < 1. Then 
apply Taylor’s Theorem with the Lagrange form of the remainder 
when 0 < a: < 1, and with the Cauchy form when — 1 < a; < 0.) 

.9 Let p>2, l/p+ 1/q = 1, and 0 < c < 1. Prove that 

(1 + c )« + (1 - c )« - 2 (1 + > 0 . 

(Use the binomial series.) 

.10 Let x,y € R, and let p,q be positive numbers with \/p + \/ q = 1. 
Use the preceding exercise to prove that 

\x + y\'i+\x-y\^<2{\x\^ + \y\PY-^ 

if 1 < p < 2, and that 

\x + y\^+\x-y\^>2{\xf + \yfY~^ 



if p > 2. 

Although the Riemann integral is taught in elementary calculus courses, 
for the best part of a century, following the development of a more sophis- 
ticated integral by Lebesgue and others, it has had little practical value. 
Indeed, Dieudonne, in characteristically forthright mood ([13], page 142), 
claims that the Riemann integral “has at best the importance of a mildly 
interesting exercise in the general theory of measure and integration. Only 
the stubborn conservatism of academic tradition could freeze it into a reg- 
ular part of the curriculum, long after it had outlived its historical im- 
portance.” We believe, nevertheless, that it is worth presenting a rigorous 
development of the Riemann integral for both historical and paedagogical 
reasons; but we skip lightly over this material, leaving much of it to the 
exercises. We discuss the Lebesgue integral in Chapter 2. 

By a partition of a compact interval I = [o, b] we mean a finite sequence 
P = (xo, xi, . . . , Xn) of points of I such that 

a = Xq < Xi < ■ ■ ■ < Xn = b. 



The real number 

max {xi+i — Xi : 0 < i < n — 1} 

is called the mesh of the partition. Loosely, we identify P with the set 
{a;o, . • . , Xn}- A partition Q is called a refinement of P if P C Q. 
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Now let /:/—>■ R a bounded function, and for 0 < z < n — 1 define 



m*(/) = inf {f{x) : Xi <x < Xi+i} , 
= sup{/(a;) : Xi<x < Xi+i} . 



The real numbers 

n—1 

P) = ^ ^i(f) i^^+l - ^i) > 

i^O 

n—1 

P (/> = X ! (^*+1 “ 

i^O 

are called the lower sum and upper sum^ respectively, for / and P. Since 
(&- a)inf / < L(/,F) < U{f,P) < (6- a) sup/, 



the lower integral of /, 

rb 



f = sup {L{f, P) : P is a partition of [a, 6]} , 
J a 



and the upper integral of /, 

ch 



f / = inf {P(/, P) : P is a partition of [a, 6]} , 
J a 



exist. 



(1.5.9) Lemma. Let f : [a, 6] — >■ R be bounded, and let P, Q be partitions 
of[a,b]. Then L{f,P) <u[f,Q). 

Proof. Take P = (xq, • ■ • , a^„), and first consider the case where Q = 
P U {/} for some point / ^ P Choose k such that Xk < ^ < Xk+i, and 
write 



a = inf {f{x) ■■ Xk < X < , 

P = inf {f{x) :^<x< Xk+i} ■ 



Then mk{f) = min {a, b} , so 



n—1 



P(/,P) = ^m.(/)( 



z=0, 

i^k 



Xj+i - Xi) + mk{f){xk+i - Xk) 



n—1 



= ^ mi{f) {x^+l - Xi) + mfc(/)(^ 

z-O, 

i^k 



Xk) + mk{f){xk+i - i) 
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n— 1 

- X! + /^(^fc +1 “ 0 

i^O, 

i^k 

= L{f,Q). 

Next, if Q is a refinement of P, then Q = P U {^i, . . . , for some distinct 
points ^ P, so 

L{f,P)<L{f,PU{^^}) 

<L(/, PU{a,6}) 

< • • • 

<L{f,Q). 

Similar arguments show that U (/, Q) < U (/, P) when Q is a refinement of 
P. 

Now consider any two partitions P, Q of [a, b] . Since P U Q is a refinement 
of both P and Q, we have 

L{f, P) < L{f, PUQ)< U(f, PUQ)< U(f, Q). □ 



It follows from this lemma that, as we might have anticipated. 



f>b pb 

/< / /• 



We say that / is Riemann integrahle over I if its lower and upper integrals 
coincide, in which case we define the Riemann integral of / over I to be 




We also define 




when / is Riemann integrable over [a, b] . 



(1.5.10) Exercises 

.1 Let / : [a, 6] — >■ R be bounded. Prove that / is Riemann integrable 
if and only if for each e: > 0 there exists a partition P of [a, b] such 
that U{f,P)-L{f,P) <e. 

.2 Let f{x) = on [0,1], and for each positive integer n let P„ be 
the partition of [0,1] consisting of the points i/n (0 < i < n). By 
considering L{f, Pn) and U{f, P„), show that / is Riemann integrable 
and that Jq f = 1 /3 . 
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.3 Prove that for any n G N the function a; i— x” is Riemann integrable 
over [a, 6]. (Use the first exercise in this set.) 

.4 Prove that an increasing bounded function / : [a, 6] — >■ R is Riemann 
integrable. 

.5 Prove that a continuous function / : [a, 5] — >■ R is Riemann integrable. 
(Use the Uniform Continuity Theorem (Exercise (1.4.8: 8)).) 

.6 Define / : [0, 1] — >■ R by 



.7 



Prove that / is 
Define / : [0,1] 



f 0 




{ 1 if a; is rational 
0 if a; is irrational, 
not Riemann integrable. 

^ Rby 

if a; = 0 or a; is irrational 

if a; = I for relatively prime positive integers p, q. 



Given £ > 0, let to = 0; let fi, . . . ,tm-i be, in increasing order, the 
points of (0, 1) that have the form p/q where p, q are relatively prime 
positive integers with 0 < q < 2/e] and let = 1. Taking a;o = 0, 
construct inductively a partition P = (xq, xi, , X 2 m+i) of [0, 1] such 
that 

X2k ^ tfi <e X2k+1 ^ 2^^ ^k+l)-, 



X2k+1 — X2k < 



e 

2(m + 1) ’ 



and X 2 m+i = 1- Show that if X 2 fc_i < x < X 2 k, then /(x) < e/2] that 
U{f,P) < e] and hence that / is Riemann integrable. 



.8 Prove that the composition of two Riemann integrable functions need 
not be Riemann integrable. (Note the preceding two exercises.) 



Since, by Exercise (1.5.10:3), /(x) = x defines a Riemann integrable 
function over [a, b], the next result generalises Exercise (1.5.10: 5). It should 
also be compared with Exercise (1.5.10: 8). 

(1.5.11) Proposition. Let f : [a, b] ^ J be Riemann integrable, where 
J is a compact interval, and let g : J ^ H be continuous. Then g o f is 
Riemann integrable. 

Proof. Let 

K = sup{p(p) : y G J}, 




66 



1. Analysis on the Real Line 



which exists by Theorem (1.4.9), and write I = [a,6] . According to the 
Uniform Continuity Theorem (Exercise (1.4.8: 8)), for each £ > 0 there 
exists 6 such that 



0 < (5 < 



£ 

b-a+2K 



and such that 



\g{x)-g{y)\ < 



£ 

b-a+2K 



whenever x,y € J and \x — y\ < S. Choose a partition P = (xq, . . . , x„) of 
/ such that 



U{f,P)-L{f,P)<6^ 



(this is possible in view of Exercise (1.5.10: 1)), and write P = SUT, where 



S' = {z : Mi{f) - TOi(/) < (5} , 
T = {i : M,if) - m,{f) > 6} . 



If i € S and x,x' G [xi,Xi+i ] , then \f{x) — f{x') \ < S and so 



\g° fix) - go f{x')\ < 



b-a+2K 



Hence 



and therefore 



Miig of)- rriiig o /) < 



b-a+2K’ 



n—1 

V {Mi{g of)- m,ig o /)) {x^+l - Xi) < V(a;*+i - Xi) 

b- a+ 2K ^ 

i^S i—O 

e{b — a) 

^ b-a + 2K' 



On the other hand, 

^(xi+i - Xi) < ^ (Miif) - rriiif)) {xi+i - xt) 

i^T i^T 

<S-^iUif,P)-L{f,P)) 

<S, 



SO 



{Mi{g of)- rriiig ° /)) i^i+i ~ Xi) < 2K '^{xi+i - Xi) 

i&T ieT 

< 2KS 
2Ke 

^ b-a+2K' 
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It now follows that 

U{gof, P)-L{gof,P) 

= X! ° f) - mi{g o /)) {Xi+i - Xi) 

i&S 

+ X! ° f) - mi{g o /)) {xi+i - Xi) 

jgt 

eih—a) 2Ke 

^ b-a + 2K ^ b-a + 2K ~ 

Reference to Exercise (1.5.10: 1) completes the proof. □ 

Let / : [a, &] — >■ R be a bounded function, and P = (xg, ■ ■ ■ , Xn) a 
partition of [a, 6]. Any expression of the form 

n—1 

i=0 

where G [xi,Xi+i] for each i, is called a Riemann sum for / (relative to 

If / is Riemann integrable, geometric arguments like those presented in 
elementary calculus courses lead us to believe that if the partition P has 
small mesh, then the corresponding Riemann sums will closely approximate 
/. This expectation is fulfilled in the first of the next set of exercises. 

(1.5.12) Exercises 

.1 Prove that a bounded function / : [a, 6] — >■ R is Riemann integrable if 
and only if there exists a real number A with the following property. 
For each £ > 0 there exists <5 > 0 such that |S — A| < e whenever 

— P is a partition of [a, b] with mesh less than S, and 
— S is a Riemann sum for / relative to P. 

Prove that, in that case, A = /. 

.2 Let f,g be Riemann integrable functions on [a,b], and let A C R. 
Prove that f + g and A/ are Riemann integrable, and that 




(Use the preceding exercise.) 
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.3 Let / : [a, 6] — >■ R be bounded, and a < c < b. Prove that / is 
Riemann integrable over [a, b] if and only if it is Riemann integrable 
over both [a, c] and [c, b] , in which case 






/■ 



.4 Let / : [a, 6] — >■ R be Riemann integrable. Prove that /“*■ = / V 0, 



/ = (— /) V 0, and |/| are Riemann integrable, and that |/^ / 

I/I . Prove also that if \f{x)\ < M for all x G [a, b], then 



< 




< M\x — y\ 



for all x,y G [a, b]. 

.5 Let / and g be Riemann integrable functions on [a, b]. Give two proofs 
that the product function fg is Riemann integrable over [a, 6]. (For 
one proof, first take / > 0 and use Exercise (1.5.10: 1). For a second 
proof, note that fg=\ ((/ + - (/ - gf) •) 

.6 Let / be a nonvanishing Riemann integrable function on the compact 
interval / = [a, 6], and suppose that 1// is bounded on I. Show that 
1// is Riemann integrable over I. 

.7 Prove that if / : [a, 6] — >■ R is continuous and nonnegative, and 
/^ / = 0, then f{x) = 0 for all x G [a, b]. 



What makes the calculation of integrals feasible is the connection be- 
tween integration and differentiation. There are various expressions of this 
connection, each of which may lay claim to the historic title of Fundamental 
Theorem of Calculus. Here is one strong version of that theorem. 

(1.5.13) Theorem. If F is differentiable on [a, 6], and F' is Riemann 
integrable over [a,b], then 



f F' = F{b)-F{a). 
J a 



Proof. Let P = (a;o, ■ . ■ , x„) be any partition of [a, b] . By the Mean Value 
Theorem, for each i there exists fi G (xi,Xi+i) such that 



F{xi+i) - F{xi) = F'{f,^){xi+i - Xj). 
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Hence 



n— 1 n— 1 

i— 0 z— 0 

Since P is any partition of [a, 6], the result now follows from Exercise 
(1.5.12:1). □ 

Let F, f be two mappings of [a, b] into R. We say that F is a primitive, 
or antiderivative, of / on [a, b] if F'(x) = f{x) for all x G [a, b]. In view of 
the Fundamental Theorem of Calculus (1.5.13), there is an obvious strat- 
egy for calculating the Riemann integral of a function / : [a, b] — >■ R: first 
find a primitive F of f, and then compute F{b) — F (a). Of course, as any 
student of calculus quickly learns, finding primitives of Riemann integrable 
functions is often a (literally) tricky business; moreover, the class of Rie- 
mann integrable functions / for which there exist primitives expressible in 
terms of elementary functions is relatively small [41]. So the Fundamental 
Theorem of Calculus has severe practical limitations, which have led to the 
development of highly accurate, fast methods of numerical integration (see, 
for example, [26]). 

(1.5.14) Exercises 

.1 Let / be Riemann integrable over I = [a, 6], and define 

F{x) = f f {a < X <b). 

J a 

Prove that F is continuous on /. Prove also that if xq G / and 

lim /(x) = /(xo), 

X—^Xq , X^I 



then 



lim 



Xq, X^I 



F{x)-F{xq) 

X — Xq 



f{xo)- 



In particular, we obtain a result which is also sometimes called the 
Fundamental Theorem of Calculus: if / is continuous on [a, 6], then 
F is differentiable on [a, 6], F'(x) = /(x) for all x G (a, 6), F'^) = 
/(a), and F'{b~) = f{b). 



Let the power series /(x) = have radius of convergence 

R (which could be oo). Prove that for each x G {—R, R), 



/-E 



^x"+i. 



n—0 



n -I- 1 



.2 
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.3 By considering 

(1 + x)(l -x + x"^ -x^ + ■■■ + (-l)”a;”), 

show that 

- OO 

— = (-i<Ki). 

n =0 

Hence find power series expansions for log(l — a;) and (1 — log(l — 
x) on (—1, 1). Then show that the identity 

i (log(l - x)f = + \ {^ + \) + \ {^ + \ + \) ^ 

is valid for each a; G (—1, 1]. 

.4 In this exercise you will prove that and therefore tt itself, is 
irrational. Given a positive integer n, define 

<j,{x) = -x'^{i-xr. 

n\ 

Prove that 

(i) = 0 for fc < n or k > 2n; 

(ii) and (/)(*)(!) are integers for all fc G N. 

Suppose that = p/q for some positive integers p, q, and define 

n 

F = q”' y^(— 

fc =0 

Show that J^(0) and i^(l) are integers, and that 
F"{x) + 7t^F(x) = 7T^p”(/)(a;). 
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We end this chapter by sketching the development of a generalisation of 
the Riemann integral. To do so we must first introduce another important 
class of functions on intervals. 

In the rest of the section, unless we say otherwise, I = [a, b] is a compact 
interval, and / a mapping of / into R. For all x,y G I with x < y we define 
the variation of / over [x, y] to be 

{ n-l 

l/(^i+l) - f{Xi)\ '■ X = Xq < Xi < ‘ - < Xn = y 

i^O 

if this quantity exists as a real number; we then say that / has hounded 
variation on [x,y\. 

(1.5.15) Exercises 

.1 Let / have bounded variation on I . Prove that 

(i) / is bounded on I, 

(ii) Tf{a, b) = Tf{a,x) + Tf{x, b) for all x G I, and 

(iii) Tf{a, •) is an increasing function on I . 

.2 Prove that / has bounded variation on I if and only if there exist 
increasing functions g, ft. on / such that / = g — h. (For “only if” note 
part (iii) of the preceding exercise.) 

.3 Let f,g be functions of bounded variation on I. Prove that f + g, 
Xf (where A G R), and fg are of bounded variation, and that if 
inf {|/(x)| : X G 1} > 0, then 1// is of bounded variation. 

.4 Define / : [0, 1] — >■ R by 

{ x^ sin if 0 < X < 1 
0 if X = 0. 

Prove that / is differentiable at each point of [0, 1] but does not have 
bounded variation on [0, 1]. 

.5 Let /:/—>■ R have bounded variation. Prove that the one-sided 
limits f{x~) and /(x+) exist at each point of (o, ft), as do f{a~^) and 
f{b~), and that the set of points of / at which / is discontinuous is 
either empty or countable. (See Exercise (1.4.5: 8).) 

.6 Let (/n) be a sequence of functions of bounded variation on I such 
that /(x) = lim„_>oo fn{x) exists for each x G I. Prove that Tf{a, ft) < 
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liminf (a, 6). (First show that for any partition (xq, xi, . . . , x„) of 
I and any positive integer k, 

n— 1 n— 1 

\f{xi+i) - f{xi)\ < Tf^{a,b) + Y \f(xi+i) - fk{xi+i)\ 

i— 0 i— 0 

n—1 

i=0 



Given e > 0, then choose k appropriately.) 



Now let a : / — >■ R be a function with bounded variation on I, P = 
(xo, xi, . . . , Xn) a partition of I, and /:/—>■ R a bounded function. Any 
expression of the form 



n—1 

Y (<^(xi+i) - a(x*)) , 

i=0 

where € [xj,Xi+i] for each i, is called a Riemann-Stieltjes sum for / 
(relative to P and a). We say that / is Riemann-Stieltjes integrable (over 
/ with respect to a) if there exists a real number A with the following 
property. For each e > 0 there exists 5 > 0 such that |S — A| < £ whenever 

— P is a partition of I with mesh less than 6 and 

— S is a Riemann-Stieltjes sum for / relative to P. 



In that case, A — the Riemann-Stieltjes integral of f with respect to a — is 
the unique real number with this property, and is usually written 



/(x)da(x). 



Exercise (1.5.12: 1) shows that the Riemann integral is just the special 
case of the Riemann-Stieltjes integral in which a(x) = x. 



(1.5.16) Exercises 

In these exercises, a has bounded variation on /, and /, g are bounded 
real- valued functions on I. 

.1 Why, in the foregoing definitions, do we require the function a to be 
of bounded variation? 
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.2 Let a be an increasing function on / = [a, b]. Given a partition P = 
(xo, xi, . . . , Xn) of /, we call the real numbers 






2 = 0 
n—l 



U{f,P,a) = (a(xi+i) - a{xi)) 



2 = 0 



respectively, the lower sum and the upper sum for / relative to P 
and a. Prove that the lower integral 

/ f{x) da(cc) = sup {L(/, P,a) \ P is a partition of /} 

J a 

and the upper integral 

f f{x) da(a;) = inf {U{f, P,a) : P is a partition of /} 

J a 

of / with respect to a exist. Prove also that / is Riemann-Stieltjes 
integrable with respect to a if and only if 



b pb 

f{x)da{x)= / f{x)da{x), 
J a 



in which case their common value is f{x) da{x). 

.3 Prove that if / is continuous, then it is Riemann-Stieltjes integrable 
with respect to a, and 



f{x) da(a;) 



< [ dTa{a,x) < MTa{a,b), 

J a 



where Ta{a,x) is the variation of a on the interval [a,a;], and M = 
sup{|/(x)| : a < X < b} . (Note that To;(a, •) has bounded variation, 
by Exercises (1.5.15: 1 and 2).) 

.4 Prove that if /, g are Riemann-Stieltjes integrable with respect to a, 
then so are f + g, f — g, and A/ (where A G R); in which case, 

/ b pb ph 

{f{x)+g{x))da{x) = / f{x)da{x)+ g{x)da{x), 

a J a J a 

/ b pb pb 

{f{x)-g{x))da{x) = / f{x)da{x)~ g{x)da{x), 

a J a J a 



and 



\f{x)da{x) = \ j f{x)da{x). 
J a 




74 



1. Analysis on the Real Line 



.5 Let a < c< b. Prove that / is Riemann-Stieltjes integrable over [o, b] 
(with respect to a) if and only if it is Riemann-Stieltjes integrable 
over both [a,c] and [c,b]; in which case, 



pb pc pb 

/ f{x)da{x)= / f{x)da{x)+ / f{x)da{x). 

J a J a J c 



.6 Prove that if a has a continuous derivative on I, then the Riemann- 
Stieltjes integral f{x) da(a;) exists and equals the Riemann integral 

.7 Let a,f3 be of bounded variation on I, and suppose that / is 
Riemann-Stieltjes integrable with respect to both a. and l3. Prove 
that / is Riemann-Stieltjes integrable with respect to a + (}, and 
that 




f{x)d{a + !3){x) 



f{x) da(a;) 




f{x) dP{x). 



Prove also that for each A € R, / is Riemann-Stieltjes integrable 
with respect to Aa, and 



/(x)d(Aa)(a:) = A / f{x)da{x). 



The next lemma enables us to discuss the continuity of the function 



(1.5.17) Lemma. Let f have hounded variation on I. Then for each 
positive integer n there exists a function : / — >■ R such that 

(i) \f{x+) - f{x) \ = \gn(x+) - gn{x)\ whenever a < x < b, 

(ii) \f{x) - f(x~)\ = \gn{x) - gn{x~)\ whenever a < x < b, 

(iii) Tf{a,-) — gn is an increasing function, and 

(iv) 0 < Tf {a, x) — g„{x) < 1/n for all x € I. 

Moreover, f and gn are continuous at precisely the same points of I. 

Proof. We may assume that a < b. Referring to Exercise (1.5.15:5), 
choose points a = xq < x\ <■■■ < Xm-i < Xm = b such that / is 
continuous &i Xi (l<t<m— 1), and 

m— 1 

^ |/(a;*+i) - f{xi)\ > Tf{a, h) - i. 

7 = 0 
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Setting gn{a) = 0, construct the function g„ on the intervals [xi,Xi+i] 
inductively, as follows. Assume that gn{x) has been defined for a < a; < 
where i < m, and consider x with Xi < x < Xi+i. If f{xi+i) — f{xi) > 0, 
set 

9n{x) = f{x) + gn{xi) - f{x^); 

if f{xi+i) - f{xi) < 0, set 

gn{x) = -f{x) + gn{Xi) + /(Xj). 

This completes the inductive construction. 

If Xi < X < x' < Xi+i, then (see Exercise (1.5. 15: 1)) 

Tf{a, x') — Tf{a, x) = Tf{x, x') 

>\f{x')-f{x)\ 

= \9ni.x') - g„(x)| 

> 9n{x') - 5„(x). 



Thus if Xi_i < X < Xi < ■ ■ ■ < Xj < x' < Xj+i, then 
Tf{a, x') - Tf{a, x) = T/(a, x') - T/(a, Xj) 

k—i 

+T/(o, Xj) - Tf{a,x) 
a-i 

> ffn(x') - 9n{Xj) + ^ i9n{Xk+l) ~ 9n{Xk)) 
k—i 

+9n{Xi) - gn{x) 

= 9n{x') - gn{x). 

It follows that Tf{a, •) — is an increasing function. For all x G [a, b] we 
now have 



0 = Tf{a,a) - gn{a) 

< Tf{a,x) - 5„(x) 

< Tf{a,b) - gn{b) 



~ ~ ^ ^ (,9n{Xi+l) — 9n{Xi)) 

i=0 
n—1 

= Tf{a, b)~Y^ |/(xi+i) - f{x,)\ 
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Finally, properties (i) and (ii) hold, since on each interval [xi,Xi+\\ either 
9 n — f or g„ + f is constant; the last part of the statement of the lemma 
follows immediately. □ 

(1.5.18) Proposition. Let f have hounded variation on I. Then 
Tf{a,x+) - Tf{a,x) = |/(x+) - f{x)\ 

if a < X < b, and 

Tf{a,x) - Tf{a,x~) = |/(x) - f(x~)\ 

if a < X < b. Hence f and Tf{a,-) are continuous at precisely the same 
points of I . 

Proof. For each positive integer n choose as in the preceding lemma. 
For a < X < x' < b we have 

\Tf{a,x') - Tf{a,x) - gn{x') + gn{x)\ 

< \Tf{a,x') - gn{x')\ + \Tf{a,x) - gn{x)\ 




Letting x' approach cc, we see that 

\Tf{a,x~'~) - Tf{a,x) - 5 „(x+) + g„(a:)| < |. 

Now noting that 

9 n{x~'') - 9n{x) = |/(x+) - f{x) \ , 

we obtain 

Tf{a,x~'~) -Tf{a,x) = lim (g„{x~^) - 9 n{x)) = \f{x+) - f{x)\ . 

The rest of the proof is left as an exercise. □ 

The preceding proposition enables us to prove a result that is used later 
(in Theorem (6.1.18)) to establish the uniqueness of the representation of 
certain continuous linear functions. 

(1.5.19) Proposition. Let a he a function of hounded variation on L = 
[a, b], and let D he the set consisting of a, b, and all points of (a, b) at which 
a is discontinuous. Then f f{x) da(a;) = 0 for each continuous function 
/:/—>■ R z/ and only if a{x) = a{a) for all x € I\D. 

Proof. Note that D is countable, by Exercise (1.5.15:5). Suppose first 
that f{x) da(x) = 0 for each continuous / on /, and consider any point 
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^ € I\D. There exist arbitrarily small positive numbers t G I\D such that 
^ < b — t. For such t let / be the continuous function that equals 1 on the 
interval [a, .C]) equals 0 on [^ + t, b], and is linear on ^ + t]. Referring to 
Exercises (1.5.16: 5 and 3), we obtain the estimate 



0 = / f{x) da(x) 






= J f{x)da{x)+J f{x)da{x) 
< a{^) — a{a) + Tq, (^, ^ + t) + 0. 




f{x)da{x) 



Letting t tend to 0 and using Proposition (1.5.18), we see that a(^) = a{a). 

Now suppose, conversely, that a{x) = a{a) for all x G I\D, and let / 
be any continuous real-valued function on I. Given e > 0, choose i5 > 0 
as in the definition of f(x) do!(a;). In view of Exercise (1.5.15: 5), we can 
choose a partition P, with mesh less than 6, consisting of a, b, and points 
of I\D. For any Riemann-Stieltjes sum S for / corresponding to P and a, 
we then have S = 0 and therefore 



f{x) da(x) 



S — / f{x) da{x) 

J a 



< S. 



Since e > 0 is arbitrary, it follows that f{x) da(a;) =0. □ 

(1.5.20) Exercises 

.1 Complete the proof of Proposition (1.5.18). 

.2 Let a be of bounded variation on I = [a, h]. Prove that f{x) da{x) = 
0 for each continuous /:/—>■ R if and only if a(x) = a{a) for all x 
in a dense subset of / that includes b. 



We bring our treatment of the Riemann and Riemann-Stieltjes integrals 
to an end here. In the next chapter we develop a type of integral, based 
on a generalisation of the integral as an antiderivative, that is much more 
powerful than the Riemann integral, and for which it is possible to construct 
(although we do not do so) a related generalisation analogous to that of 
Riemann-Stieltj es . 

For more on Riemann and Riemann-Stieltjes integration see [17], [42], 
or [50]. 




2 



Differentiation and the Lebesgne 
Integral 



More matter with less art. 
HAMLET, Act 2, Scene 2 

In the first section of this chapter we show how the ideas of Chapter 1 can 
be applied in a theory of the length of a subset of R; this leads to the Vitali 
Covering Theorem, a result with many interesting applications in the theory of 
differentiation and integration. Building on that material, in the next two sections 
we describe F. Riesz’s development of Lebesgue integration as the inverse process 
to differentiation “almost everywhere”. 



2.1 Outer Measure and Vitali’s Covering Theorem 

Can we assign to a subset A of R a measure of its length? We have already 
done this when A is a bounded interval; but what about a more general 
set A7 The answer lies in measure theory, a subject that was pioneered by 
Lebesgue, Borel, and others at the beginning of this century and which has 
proved of immense importance in analysis, probability theory, and many 
other areas of mathematics. 

The outer measure of A is the quantity 

ti*{A) = inf AIn\- (^n)“=i is a cover of A 

by bounded open intervals} , 
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which we take as c» if the set on the right-hand side is unbounded.^ If 
G R, we say that A has finite outer measure. Note that since, for 
any sequence (/„) of bounded open intervals that covers A, the terms of the 
series positive, the (possibly infinite) sum of the series does 

not depend on the order of those terms; this is an immediate consequence 
of Exercise (1.2.17: 1). 

If A has outer measure zero, then we say that Gl is a set of measure zero, 
or that A has measure zero. Thus A has measure zero if and only if for 
each e > 0 there exists a sequence (/n))^i of bounded open intervals such 
that A C U“=i and l^«l < £• 

(2.1.1) Exercises 

.1 Show that for each A C R, ia*{A) is the infimum of \^n\ 

taken over all covers of A by sequences (/n)))°=i of bounded, but not 
necessarily open, intervals. 

.2 Prove that if a subset A of R has finite outer measure, then for each 
£ > 0 there exists a sequence (/„) of disjoint bounded open intervals 
such that A C U^i l^"l < M*(^) +^- (Use Proposition 

(1.3.6).) 

.3 Show that = 0, and that if Ac B, then n*{A) < n*{B). 

.4 Prove that for each a G R, /x*({a}) = 0. 

.5 Let A be a subset of R, and E C A a set of measure zero. Show that 
^^*{A\E)=^,*{A). 

.6 Let A be a subset of a compact interval I. Prove that fJ,*{A) + 
H*{I\A) > |/| . (It follows from results towards the end of Section 3 of 
this chapter that, perhaps surprisingly, we cannot replace inequality 
by equality in this result.) 

.7 Let (An) be a sequence of subsets of R. Show that 

( oo \ oo 

U A„ <J2ti*{An), 

n=l / 

where the right-hand side is taken as oo if either any of its terms is oo 
or the series diverges. (If one of the sets An has infinite outer measure, 
then the inequality is trivial. If each An has finite outer measure, then 
for each positive integer n and each £ > 0 there exists a sequence 



^In Section 1 of Chapter 3 we give a precise meaning to this use of oo as an 
“extended real number”. 
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{In fc)^i of bounded open intervals such that A„ C IJ^i In k and 
Etl\In,k\<t^*{^n) + 2-^e.) 

Prove that if also the sets An are pairwise-disjoint, then 

( oo \ oo 

n=l / n=l 

.8 Give two proofs that a countable subset of R has measure zero. Hence 
prove that R is uncountable. 

.9 Give two proofs that the union of a sequence of sets of measure zero 
has measure zero. 

.10 Prove that a subset H of R has finite outer measure if and only if 
I = lim„_>oo fk*{A n [—n, n]) exists, in which case H*{A) = 1 . 

.11 Prove that /r* is translation invariant — that is, /i*(H + t) = fJ,*{A) 
for each H C R and each t G R, where A + t = {x + t:xG A}. 

(2.1.2) Proposition. The outer measure of any interval in R equals the 
length of the interval. 

Proof. Gonsider, to begin with, a bounded closed interval [a, h]. For each 
£ > 0 we have [a, h] C {a — e,b + e) and therefore 

fk*{[a, b]) < |(a — e, & + e)| = b — a + 2e. 

Since e > 0 is arbitrary, we conclude that 6]) < b — a. To prove the 

reverse inequality, let (/„) be any sequence of bounded open intervals that 
covers [a, b]. Applying the Heine-Borel-Lebesgue Theorem (1.4.6), and re- 
indexing the terms /„ (which we can do without loss of generality) , we may 
assume that for some N, 



[a,b] C IiUhU-'-UlN- 

There exists an interval , where 1 < fci < A^, that contains a; let this 
interval be (oi, 6i). Either & < &i, in which case we stop the procedure, or 
else bi < b. In the latter case, bi G [a, 6]\(ai, &i); so there exists an interval 
Ik2, where 1 < k2 < N and 7^ ki, that contains bi; call this interval 

(02,62)- Repeating this argument, we obtain intervals (oi,6i), (02,62),... 
in the collection {Ii, . . . , 1 ^} such that for each i, Ui < 6i_i < bi. This 
procedure must terminate with the construction of (o^, bj) for some j < N. 
Then 6 G (o^ , bj), so 

N j 

\In\ > ^(6* -Oi) 
n—1 i—1 
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= bj - (aj - bj-i) - {aj-i - bj-2) 

(o2 - 61) - ai 

> bj — ai- 

It follows that l-^nl > b — a and therefore, since (/„) was any sequence 

of bounded open intervals covering [a, 6], that &]) > b — a. Coupled 

with the reverse inequality already established, this proves that 6]) = 
b—a. 

The proof for other types of interval is left as the next exercise. □ 

(2.1.3) Exercises 

.1 Complete the proof of Proposition (2.1.2) in the remaining cases. 

.2 Let {/i, . . . ,In} be a finite set of bounded open intervals covering 
Q n [0, 1]. Prove that J2n=i \^n\ > 1. (Given e > 0, extend each J„, if 
necessary, to ensure that it has rational endpoints and that the total 
length of the intervals is increased by at most e. Then argue as in the 
proof of Proposition (2.1.2).) 

.3 Let X be a subset of R with finite outer measure. Prove that for each 
e > 0 there exists an open set A D X with finite outer measure, such 
that +s. (Use Exercise (2.1.1: 2).) Show that if X is 

also bounded, then we can choose A to be bounded. 

Let X be a subset of R, and V a family of nondegenerate intervals — that 
is, intervals each having positive length. We say that V is a Vitali covering 
of X if for each £ > 0 and each x G X there exists I G V such that x G I 
and |/| < £. 

(2.1.4) The Vitali Covering Theorem. Let V be a Vitali covering of 
a set X C R with finite outer measure. Then for each £ > 0 there exists a 
finite set {/i, . . . ,In} of pairwise-disjoint intervals in V such that 

/i* |^X\ U < £. 

We postpone the proof of this very useful theorem until we have dealt 
with some auxiliary exercises. 

(2.1.5) Exercises 

.1 Let V be a Vitali covering of a subset X of R, a; a point of X, and 
A an open subset of R containing X. Show that for each £ > 0 there 
exists I GV such that x G I, I C A, and |/| < £. 




2.1 Outer Measure and Vitali’s Covering Theorem 83 



.2 Let Ii, . . . ,In be finitely many closed intervals belonging to a Vitali 
covering V,of a subset X of R with finite outer measure, and let x G 
In- Show that for each e > 0 there exists / G V such that 
X £ I, |/| < £T, and / is disjoint from In- 



Proof of the Vitali Covering Theorem. If necessary replacing the 
intervals in I by their closures, we may assume that V consists of closed 
intervals. Referring to Exercise (2.1.3: 3), choose an open set Ad X with 
finite outer measure. In view of Exercise (2.1.5: 1), we may assume without 
loss of generality that 



/ C R for each 7 G V. (1) 

Choosing any interval 7i in the covering V, we construct pairwise-disjoint 
intervals Ii,l 2 , ... in V inductively as follows. Assume that we have con- 
structed 7i, ...,/„ in V. If A C Ufc=i Ik^ then /x* (A\1J^^^ Ik) = 0 and 
we stop the construction. If X is not contained in Ik: then Exercise 

(2.1.5: 2) shows that the set 



S^=l\I\:I €V, /n|j7fc = 0 

I fc=i 

is nonempty. Since, by (1), is bounded above by it follows that 

Sn = sup Sn 

exists; moreover, as each 7 G V is nondegenerate, s„ > 0. To complete our 
inductive construction, we now choose In+i G V such that In+i H Ufc=i Ik = 
0 and |7,^_i_]^| ^ 

We may assume that this construction leads to an infinite sequence 
(7„)^i of pairwise-disjoint elements of V. Since the partial sums of the 
series l^nl bounded by fj,*{A), the monotone sequence principle 

(Proposition (1.2.4)) ensures that the series converges. Given e > 0, we can 
therefore find N such that 



n—N-\-l 

For each n > V let be the midpoint of 7„, and let J„ be the closed 
interval with midpoint x„ and length 5 |7„| . It suffices to prove that 

N OO 

X\\JlnC IJ Jn- 

n—1 n—N-\-l 



( 2 ) 
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For then 

( N \ oo oo 

A U p E = 5 E 

n— 1 / n=N-\-l n—N-\-l 

To prove (2), consider any x G Exercise (2.1.5: 2), there 

exists / G V such that x £ I and I fl Un=i = 0- We claim that I (1 Im 
is nonempty for some m > TV. If this were not the case, then for each m 
we would have / fl U™=i = 0 and therefore |/| < Sm < “^ l^m+i| i since 
limm_>oo \Im\ = 0 (by Exercise (1.2.14: 1)), it would follow that |/| = 0, 
which is absurd as V contains only nondegenerate intervals. Thus 

v = min{m > N : I C\ Im 

is well defined, I fl Un=i = 9, and therefore |/| < Si/-i < 2 |/jy| . Since 
X £ I and / n /jy 0, we see that 

\x — Xi/\ < |/| + 5 |/jy| < 2 \Ii/\ + i \Ii/\ = I \Ii/\ . 

Hence x £ Jy. This establishes (2) and completes the proof. □ 

In the remainder of this section we apply the Vital! Covering Theorem 
in the proofs of some fundamental results in the theory of differentiation 
and integration. 

Let I be an interval in R. We say that a mapping /:/—>■ R is ab- 
solutely continuous if for each £ > 0 there exists i5 > 0 such that if 
([o/c, ^fe])fc=iis a finite family of nonoverlapping^ compact subintervals of 
I such that YJk=i - flfe) < S, then YIk=i l/(^fe) “ /(“fe)! < £• 

(2.1.6) Exercises 

.1 Prove that an absolutely continuous function on / is both uniformly 
continuous and bounded. 

.2 Let /, g be absolutely continuous functions on I . Prove that the func- 
tions f-\-g, f—g, A/ (where A G R), and fg are absolutely continuous, 
and that if inf {|/(a;)| : a; G /} > 0, then 1// is absolutely continuous. 

.3 Prove that if / is differentiable, with bounded derivative, on an 
interval /, then / is absolutely continuous. 

.4 Let / be absolutely continuous on a compact interval / = [a, b]. Prove 
that / has bounded variation in I, that the variation function Tf{a, •) 
is absolutely continuous on /, and that / is the difference of two abso- 
lutely continuous, increasing functions on I. (See Exercises (1.5.15: 1 
and 2).) 



^Two intervals in R are nonoverlapping if their intersection is either empty 
or contains only endpoints of the intervals. 
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Let S' be a subset of R, and P{x) a statement about real numbers x. If 
there exists a set E of measure zero such that P{x) holds for all x in S\E, 
then we say that P{x) holds almost everywhere on S, or, more loosely, that 
P holds almost everywhere on S; in the case S = R we say simply that 
P{x), or P, holds almost everywhere. 

A simple corollary of the Mean Value Theorem (Exercise (1.5.4: 6)), one 
that suffices for many applications, states that if / is continuous on [a, b] 
and \f'{x)\ < M for all x G (a, &), then |/(6) — /(a)| < M{b — a). Our first 
application of the Vitali Covering Theorem generalises this corollary, and 
can be regarded an extension of the Mean Value Theorem itself. 

(2.1.7) Proposition. Let f be an absolutely continuous mapping of 
a compact interval I = [a, b] into R, and F a differentiable increasing 
mapping of I into R such that |/^(x)| < F'{x) almost everywhere on I. 
Then 

\f{b)-f{a)\<F{b)-F{a). (3) 

Proof. Let E C / be a set of measure zero such that \ f'{x)\ < F'{x) for 
each X G X = I\E. We may assume without loss of generality that a,b G E. 
Given e > 0, choose (5 > 0 as in the definition of absolute continuity. For 
each X G X there exist arbitrarily small r > 0 such that [x, x + r] C (a, b), 

\f{x + r) - f{x) - f(x)r\ < sr, 

\F{x + r) — F{x) — F'{x)r\ < er, 



and therefore 



\f{x + r) - f{x)\ < \f'{x)\r + er 

< F'{x)r + er 

< F{x + r) — F{x) + 2er. 



The sets of the form [x, x + r] , for such r > 0, form a Vitali covering of 
X. By the Vitali Covering Theorem, there exists a finite, pairwise-disjoint 
collection {[xk,Xk + Tk\)k=i of sets of this type such that 



N 



X\ U [Xk, Xk + ?’fc] < s. 



fc=i 



We may assume that Xk + Vk < Xk+i for 1 < fc < iV — 1. Thus 



N-l 

xi - a + ^ (xfc+i - Xk - rk) + b - xn - rjM < S, 
k=l 
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and therefore 



N-l 



fc=l 



|/(xi) - f{a)\ + \f{xk+i) - f{xk + r-fc)| + \f{b) - /{xn + rN)\< s. 
It follows that 

1 /( 6 ) - f{a)\ < \f{xi) - f{a)\ + Y \fi^k+i) - f{xk + rk)\ 



N-l 



k^l 



N 



■ \f{b) - f{xN + fAf)| + ^ \f{xk + Tk) - f{Xk)\ 



k=l 



N 



< 



< 



s + Y + "^k) - F{xk) + 2erk) 



fc=i 



Af-l 



£ + F{xi) - F{a) + Y {F{xk+i) - F{xk + Xk)) 



k=l 



N 



+ Y i^i^k + Tk) - F{xk) + 2erfc) 



k=l 



+ (F(6) - F{xn + tn)) 



N 



= e + F{b) - F{a)+2eY 



Xk 



fc=i 



< F(6) - F{a) + e(l + 26 - 2a). 

Since e > 0 is arbitrary, we conclude that (3) holds. □ 



(2.1.8) Exercises 

.1 Let / be absolutely continuous on / = [a,6] , and suppose that for 
some constant M, |/'| < M almost everywhere on I. Prove that 
|/(6)-/(a)|<M(6-a). 

.2 Let / : [a, 6] —f R be an absolutely continuous function such that 
f'{x) = 0 almost everywhere on J = [a, 6] . Give two proofs that / is 
a constant function. (For one proof use the Vitali Covering Theorem.) 

.3 Let /, F be continuous on / = [a, 6], and suppose there exists a count- 
able subset D of I such that |/'(a;)| < F'{x) for all x G I\D. Show 
that 1/(6) — /(a)| < F{b) — F{a). (We may assume that D is count- 
ably infinite. Let di,d 2 ,--- be a one-one mapping of N’'" onto D. 
Given £ > 0, let X be the set of all points x G I such that 
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for all ^ G [a, x), and let s = supX. Assume that s < b, and derive a 
contradiction.) 

.4 Let / be continuous on / = [a, b] , and suppose there exists a count- 
able subset D of I such that f'{x) = 0 for all x € I\D. Prove that / 
is constant on I. 

.5 Let C be the Cantor set (see Exercise (1.3.8: 11)). Show that [0, 1]\C 
is a countable union of nonoverlapping open intervals whose 

lengths sum to 1, and that C has measure zero. 

For each x = G C define E(a;)= Show 

that 

(i) if X has two ternary expansions, then they produce the same 
value for F{x), so that F is a function on C; 

(ii) F is a strictly increasing, continuous mapping of C onto [0, 1] ; 

(iii) C is uncountable; and 

(iv) F extends to an increasing continuous mapping that is con- 
stant on each J„, equals 0 throughout (— oo,0], and equals 1 
throughout [l,oo). 

Prove that for each <5 > 0 there exist finitely many points 



ai < 0 < &i < 02 < • • • < bj\[—i < Ojv < 1 < b]\[ 
of [-1, 2] such that C C lj^=i [^n, bn] , 

N 

^ (F(6„) - F(o„)) = 1, 

n— 1 

and '^n=i — ctn) < b. (Thus F is increasing and continuous, but 

not absolutely continuous, on [—1,2] .) 

Finally, show that F'(x) = 0 for all x G [0, 1]\C, but F(l) > F(0). 



The last two exercises deserve further comment. Consider a continu- 
ous function F on [0, 1] whose derivative exists and vanishes throughout 
[0, 1]\F. If F is countable, then Exercise (2.1.8: 4) shows that F is con- 
stant. On the other hand. Exercise (2.1.8: 5) shows that if F is uncountable 
and of measure zero, then F need not be constant; but if, in that case, F 
is absolutely continuous, then it follows from Exercise (2.1.8: 2) that it is 
constant. 
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Although the derivative of a function / may not exist at a point a: G R, 
one or more of the following quantities — the Dini derivates of / at x — may: 



D+f{x) 

D+f{x) 

D-f{x) 

D-f{x) 



lim sup 

/1-S-0+ 

lim inf 
lim sup 
lim inf 



f{x + h) - f{x) 
h 

f{x + h)~ f{x) 
h 

f{x + h) - f{x) 
h 

f{x + h)- f{x) 
h 



We consider D'^f{x) to be undefined if 

— either there is no > 0 such that / is defined throughout the interval 
[x, x + h] 

— or else {f{x + h) — f{x)) //iremains unbounded as /i — f 0+. 

Similar comments apply to the other derivates of /. 



(2.1.9) Exercises 

.1 Prove that D^f{x) > D+f{x) and D~ f{x) > D-f{x) whenever the 
quantities concerned make sense. 

.2 Prove that / is differentiable on the right (respectively, left) at x if 
and only if f{x) = D^f{x) (respectively, D~ f{x) = D_f{x)). 

.3 Let / be a mapping of R into R, and define g{x) = —f{—x). Prove 
that for each a; G R, D'^g{x) = D~ f{—x) and D-g{x) = D+f{—x). 

.4 Let / : [a, 6] — >■ R be continuous, and suppose that one of the four 
derivates of / is nonnegative throughout (a, 6). Prove that / is an 
increasing function on [a, b]. (Show that x i— f{x) + ex is increasing 
for each e > 0.) 

.5 Consider a function / : [a, 6] — f R, and real numbers r, s with r > s. 
Define 



E = {x e {a, b) : D^f{x) > r > s > D_/(x)} . 

Let X be an open set such that E C X and g*{X) < g*{E) + e (see 
Exercise (2.1.3: 3)). Prove that the intervals of the form (x — h,x) 
such that x G E, h > 0, [x — h,x] C X, and f{x) — f{x — h) < sh 
form a Vital! covering of E. Hence prove that for each e > 0 there 
exist finitely many points xi, . . . , Xm of E, and finitely many positive 
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numbers hi, , hm, such that the intervals Jj = (xi — hi, Xi) (1 < 
i < m) form a pairwise-disjoint collection, 

>h*{E)-e 

and 

m 

X! “ ^*)) < + e) . 

i=l 

Again applying the Vitali Covering Theorem, prove that there ex- 
ist finitely many points yi, . . . ,y„ of E D IJl^i finitely many 

positive numbers h'l, . . . ,h'„, such that 

Vk + K < Vk+i {f<k<n-l), 

for each k there exists i such that {yk,Vk + h'jJ C Ji, and 

n 

X! (-f(yk + K) - fivk)) > r - 2e) . 

k=l 

Our next theorem shows, in particular, that the differentiability of the 
function E can be dropped from the hypotheses of Proposition (2.1.7). 

(2.1.10) Theorem. An increasing function / : R — >■ R zs differentiable 
almost everywhere. 

Proof. It suffices to show that the sets 

S' = {x G R : D^f{x) is undefined} , 
T={x€-R:D+f{x)>D_f{x)} 

have measure zero. For, applying this and Exercise (2.1.9: 3) to the increas- 
ing function x — /(— x), we then see that D~f{x) < H+/(x) almost 

everywhere; whence, by Exercises (2.1.9: 1) and (2.1.1: 9), 

D^f(x) < D_f{x) < D~ f{x) < D+f{x) < D+f{x) G R 

almost everywhere. (Note that as / is increasing, Dj^.f{x) and D-f{x) are 
everywhere defined and nonnegative.) Thus the four Dini derivates of / are 
equal almost everywhere. Reference to Exercise (2.1.9: 2) then completes 
the proof. 

Leaving S to the next set of exercises, we now show that T has measure 
zero. Since T is the union of a countable family of sets of the form 

E = {x € (a, b) : D'^f{x) > r > s > D_f{x)} , 

where a < b and r, s are rational numbers with r > s, it is enough to 
prove that such a set E has measure zero. We first use Exercise (2.1.9: 5) 
to obtain 
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(i) finitely many points xi, . . . , Xm of (a, b) , and finitely many positive 
numbers hi,... ,hm, such that the intervals Ji = {xi — hi,Xi) (1 < 
i < m) form a pairwise-disjoint collection, 

>h*{E)-e, 

and 

m 

X! “ ^*)) < + e) ; 

i=l 

(ii) finitely many points yi,...,yn of ED Ul^i finitely many 

positive numbers h[, . . . ,h'„, such that 

yk + K<Vk+i (l<fc<n-l), (4) 

for each k there exists i with {yk,Vk + h'f.) C Ji, and 

n 

tfiyk + K) - fivk)) > r {y*{E) - 2e) . 



For each i with 1 < f < m let 

Si = {k-. {yk,yk + Cfe) c JJ . 

Since / is increasing, it follows from (4) that 

Y + K) - fiVk)) < f{x^) - f(xi - hi). 

keSi 

Thus, as the intervals Ji are disjoint, 

m n 

Y - Y + ^'k) ~ f(yk )) . 

2=1 k—1 



so that 

s{ix*{E) + e) >r{n*{E)-2e). 

Since £ > 0 is arbitrary, it follows that sy,*{E) > ry*{E). But r > s, so we 
must have y*{E) = 0. □ 

We make good use of the following consequence of Theorem (2.1.10). 

(2.1.11) Fubini’s Series Theorem. Let {En) he a sequence of increas- 
ing continuous functions on R such that F(x) = Yl^=iEn{x) converges 
for all X G R. Then almost everywhere, F is differentiable, Y^'^=iEn{x) 
converges, and F'{x) = En(x). 
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Proof. Fix real numbers a, b with a < b. It suffices to prove that 
F'{x) = Pn{^) almost everywhere on / = [a, b]: for then we can apply 

the result to the intervals [—n,n] as n increases through N’*'. If necessary 
replacing F„ by — Fn{a), we may assume that F„(a) = 0. Write 

s„(a;) = Fi(x) H \- Fn{x) (x € I) 

and note that F — Sn = is increasing and nonnegative. By 

Theorem (2.1.10), s„ is differentiable on I\An for some set An of measure 
zero; likewise, F (which is clearly increasing) is differentiable on I\Aq for 
some set Aq of measure zero. Then 



OO 

^ An 

n— 0 

has measure zero, by Exercise (2.1.1: 9). Since both F — Sn+i and s„+i — s„ 
are increasing functions, for each x G I\A we have 

s'nix) < 4+1 (a;) < F'{x). (5) 

It follows from the monotone sequence principle that ^n(x) converges 

to a sum < F'{x). 

Now choose an increasing sequence of positive integers such that 

for each k, 

0<F(6)-s„,(6)<2-^ 

Since F — Sn^ is an increasing function, for each x G I we obtain the 
inequalities 

0<F(x)-s„,(x) <2-^ 

Hence (F(x) — Sn^,(x)) converges, by comparison with 

Applying the first part of the proof with Fk replaced by F — s „^ , we now 
see that, almost everywhere on I, ~ converges and 

therefore 

lim (F'{x) - 4,(4) = 0- 

fe-s-oo 

It follows from (5) that 

OO 

F'{x) = lim Sn{x) = Fnjx) 

n—^oo * ^ 

n—\ 



almost everywhere on I. □ 

(2.1.12) Exercises 

.1 Let / be an increasing function on [a, b], and for each positive integer 
n define 



Sn= {x G (a, b) : D^f{x) > n} . 
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Prove that 

(/(&)- /(a)) 

and hence that the set of those x G (a, b) at which D'^f{x) is 
undefined has measure zero. (Use the Vitali Covering Theorem to 
show that there exist finitely many points xi,X 2 , ■ ■ ■ ,Xm of (a,b), 
and positive numbers h\,h 2 , ■ ■ ■ , hm, such that Xk + hk < Xk+i and 
f{xk + hk) - f{xk) > nhk.) 

.2 Let if be a bounded subset of R that has measure zero, and let a be 
a lower bound for E. For each positive integer n choose a bounded 
open set An D E such that ^*{An) < (this is possible by Exercise 
(2.1.3: 3)), and define 



I 0 if a; < a 

fn{x) = < 

[ {An r\ [a, x]) if X > a. 



Show that 

(i) / = fn is an increasing continuous function on R; 

(ii) D^f{x) is undefined for each x G E. 

.3 Prove that if / has bounded variation on [a, b] , then it is differentiable 
almost everywhere on [a,b\. (The converse is not true: see Exercise 
(1.5.15: 4).) 

.4 Let / have bounded variation on [a, 6]. Prove that Tj{a,x) = \ f'{x)\ 
almost everywhere on [a,b]. (Using Lemma (1.5.17), construct a se- 
quence ((/„) of functions on / such that for each n, Tf{a, •) — g„ is 
increasing, 0 < Tj{a, •) — < 2“", and g'n = ±/' almost everywhere. 
Then use Fubini’s Series Theorem.) 

.5 Prove that if a bounded function / is continuous almost everywhere 
on a compact interval I, then it is Riemann integrable. (Let M be a 
bound for |/| on /, let if C / = [a, b] be a set of measure zero such 
that / is continuous on X = I\E, and let £ > 0. We may assume 
that a,b G E. For each x G X there exist arbitrarily small r > 0 such 
that [x,x + r] Cl and 

\f{x') - f{x")\ < {x<x'<x"<x + r). 

2(0 — a) 

The sets [x, a; -I- r] of this type form a Vitali cover of X. With the 
aid of the Vitali Covering Theorem, construct a partition P of I such 
that U{PJ)-L{PJ) <s.) 
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.6 Prove the converse of the last exercise — namely, if a bounded function 
/ : [a, 6] — > R is Riemann integrable, then it is continuous almost 
everywhere on [a,b\. (For each positive integer n define 

An = {x€ [a,b] : uj{f,x) > , 

where uj{f, x) is the oscillation of / at x] see Exercise (1.4.5: 7). Given 
£ > 0, choose a partition P of [a, &] such that U{f,P) — L{f,P) < 
£/2n. Use this to construct a finite set of intervals that cover and 
have total length less than £.) 

Re-examine Exercise (1.5.10: 6) in the light of this result. 



2.2 The Lebesgue Integral as an Antiderivative 

In this section we show how Theorem (2.1.10) and Fubini’s Series Theorem 
(2.1.11) can be used to introduce the Lebesgue integral, a very powerful 
extension of the Riemann integral, as an antiderivative. Our approach^ is 
based on a little-known development by F. Riesz [39]. 

Let / be a nonnegative real-valued function defined almost everywhere 
on R. A function E : R — >■ R is called a Lebesgue primitive of / if it is 
increasing, bounded below, and satisfies F' = f almost everywhere. 

In order to discuss Lebesgue primitives, we first consider the set Pf 
of functions E : R — >■ R that are increasing, bounded below, and satisfy 
F' > f almost everywhere. Note that for such a function, 

E(— oo) = lim F{x) 

x—¥—oo 

exists: indeed, the sequence (E(— n))))T;^, which is decreasing and bounded 
below, converges to a limit which is easily shown to be E(— oo). 

(2.2.1) Proposition. If Pf is nonempty, then there exists an element 
F* G Pf, called an extremal element of Pf, such that 

F4v) - F.iO < F{v) - F{0 ( 1 ) 

whenever ^ < rj and F GPf. 

Proof. First note that the set 

pO = {FGPf. E(-oo) = 0} 



®It is worth comparing this with the development of the Cauchy integral in 
[13]. 
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is nonempty: for if F € Vf, then F — F{—oo) G V^. It is now a 
straightforward exercise to show that 

F^x) = inf {F{x) : F G V°f} 

defines an increasing function F, : R — >■ R°“'" with F*(— oo) = 0. Given 
£ > 0 and real numbers ij with ^ < ry, choose Fi G F° such that Fi(^) < 
F*(^) + £, and consider any element F of Pf. The function F 2 defined by 

f ^i(x) if X < ^ 

J^2(^) = { 

I F(x) + Fi(0 - F(0 if^<x 

belongs to so 

P*(v) < F^iv) 

= F{7j) + F^i0-F{0 
<F{7j) + F40 + £-F{0, 

and therefore 

F4rj)-F,{0<F{v)-F{0+e. 

Since e > 0 is arbitrary, inequality (1) follows. 

Now let be a positive integer, and choose a sequence (F„) in Vj such 
that for each n, 

0<F„(fV)-F,(Af)<2-”. 

For each x G [—IV, N] we have 



F4N) - F4x) < Fn{N) - F„(x) 



and therefore 



0 < F„(x) - F,{x) < Fr,{N) - F,{N) < 2~^. 

Hence the series (Fn ~ F^) of increasing functions converges at each 

point of [-N, A^] , by comparison with Fubini’s Series Theorem 

(2.1.11) now shows that (F^ — F') converges almost everywhere on 

[-N, N], Hence F^ — F^ ^ 0, and therefore F' > /, almost everywhere on 
[-N, Af]. Since the union of a sequence of sets of measure zero has measure 
zero, it follows that F^ > / almost everywhere on R. □ 

(2.2.2) Corollary. Under the conditions o/Proposition (2.2.1), Fi is an 
extremal element ofPf if and only if F^ — Fi is constant on R. 

Proof. If Fi is an extremal element oiPf, and f <rj, then 

FM - F.{f) < Fi(ry) - F,{f) < F,(ry) - F*(^ 
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and therefore 

F4v)-F.{0 = Fi{ri)-Fi{0- 

It follows that 

Fi{x) — F*(a;) = Fi(— oo) — J^*(— oo) 
for all a: G R. The converse is trivial. □ 

(2.2.3) Corollary. Under the conditions o/ Proposition (2.2.1), if f has 
a Lebesgue primitive, then is also a Lebesgue primitive. 

Proof. Let T’ be a Lebesgue primitive of /. Then F G Pf, so by 
Proposition (2.2.1), 

F.(V) - F40 < F(p) - F(0 (f<p). 

Since a finite union of sets of measure zero has measure zero, it follows 
from this inequality and Theorem (2.1.10) that 

f<F:<F' = f 

almost everywhere. Hence Fl = f almost everywhere, and is a Lebesgue 
primitive of /. □ 

We say that a nonnegative function / defined almost everywhere on R is 
Lebesgue integrable (or simply integrable) if there is a bounded Lebesgue 
primitive of /. In that case we define the Lebesgue integral (or simply the 
integral) of / to be 

y ./ = Ft{oo) - T;(-oo), 
where F^. is an extremal element of Pf and 

Tlt(oo) = lim Ft,{x). 

x—^oo 

(The existence of ^*( 00 ) is left as an exercise.) Corollary (2.2.2) shows that 
the value of the integral of / does not depend on the choice of extremal 
element in Pf. Note that 

[ / = sup (F*(y) - F*(a;)) . 

J x<y 

We often write 

y / = y fix) = J /(^) dt = • • • , 

as in elementary calculus courses. 

(2.2.4) Exercises 
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.1 In the notation of the proof of Proposition (2.2.1), prove that F* is 
an increasing function and that F*(— oo) = 0. 

.2 Prove that if some element oi Vf is bounded above and F* is an 
extremal element of Vf, then F, is bounded above and F*(oo) exists. 

.3 Let / be an integrable nonnegative function, and F a Lebesgue primi- 
tive of /. Prove that if F is absolutely continuous on each compact in- 
terval, then it is an extremal element of Vf. (Use Proposition (2.1.7).) 
Is every bounded Lebesgue primitive of / an extremal element of 
Vf? 

.4 Show that if / > 0 is Lebesgue integrable, then 

f f = inf I sup (F(j/) - F{x)) : F G Vf \ . 

J {x<y J 



.5 Let f(x) equal a nonnegative constant c in a bounded interval I, and 
0 outside I. Show that / is Lebesgue integrable, with f f = c\I\ . 

.6 Let / be an integrable nonnegative function. Prove that f f = 0 if 
and only if / = 0 almost everywhere. 

.7 Let / be an integrable nonnegative function such that f f > 0. Prove 
that f(x) > 0 on some set with positive outer measure. (Suppose that 
for all positive integers m and n, 

^ni,n = {a; G [-m,m] : f{x) > 

has measure zero, and use the preceding exercise to obtain a 
contradiction.) 

.8 Let / and g be integrable nonnegative functions such that f > g 
almost everywhere, and let F,G be extremal elements of Vf,Vg, 
respectively. Prove that 

(i) F-G&Vf-g, and 

(ii) f — g is integrable. 

(For (ii) note that F' > g almost everywhere.) 

.9 Let / be an integrable nonnegative function, and F a Lebesgue prim- 
itive of /. Show that if F'(^) = /(^) and s„ < ^ < s„ -I- 2“” for each 
n, then 

hm 2-" f = no- 



(Note Exercise (1.5.1: 3).) 
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(2.2.5) Lemma. Let <i>, 'll, and ^ — he increasing functions on R such 
that <I> is an extremal element of V^' and 'll is bounded. Then 'll is an 
extremal element of 

Proof. Note that <!>' and 'll' are defined almost everywhere, by Proposi- 
tion (2.1.10). Hence 'll G By Proposition (2.2.1), has an extremal 
element 'll*. The function 



0 = $ - T»* 

is increasing, bounded below, and has derivative equal to d>' almost ev- 
erywhere; so it belongs to Since d> is an extremal element of P$/, it 
follows that 'll* — 'll = 0 — <I> is an increasing function. But by our choice 
of 'll* , 'll — 'll* is an increasing function. It follows that 'll — 'll* is constant; 
whence, by Corollary (2.2.2), 'll is an extremal element of V^,'. □ 

(2.2.6) Proposition. If f,g are integrable nonnegative functions defined 
almost everywhere , and A > 0 , then f + g and Xf are integrable, 

J {.f + g) = J f + J ( 2 ) 

and 

Proof. Let be extremal elements of Vf, Tg, respectively. Then 

F* -I- G* is a bounded Lebesgue primitive of / -I- 5 , which is therefore inte- 
grable; but there is no guarantee that f* -I- G* is an extremal element of 
Vf+g, so we have to work harder to establish the identity (2). To this end, 
let iL* be an extremal element of Vf+g. Then if* -I- G* — if* is an increasing 
function. On the other hand, ii* is increasing, and ii' = f + g > g almost 
everywhere; so by our choice of G*, — is increasing. Applying Lemma 

(2.2.5) with d> = ii* and 'll = if* — G*, we see that ii* — G* is an extremal 
element of Pf, whence, by Corollary (2.2.2), — G^. — if* has a constant 

value c. It follows that 

J i.f + 9) = H*(oo) - ii*(-oo) 

= (G*(oo) -I- F*(oo) + c) — (G*(— 00) -I- F*(— 00) -I- c) 

= (F*(oo) — F*(— 00)) -I- (G*(oo) — G*(— 00)) 

It is left as an exercise to deal with Xf. □ 



(2.2.7) Proposition. If (/„) is a sequence of integrable nonnegative 
functions defined almost everywhere, then f = inf /„ is integrable. 
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Proof. For each n choose an extremal element Fltn of , and note that, 
by Corollary (2.2.3), is a Lebesgue primitive of /„. Then G Vf, so 
Vf is nonempty. By Proposition (2.2.1), there exists an extremal element 
F* of Vf, and F^n~F^ is increasing; so (Jf*„ — FV)' > 0 almost everywhere. 
Hence, almost everywhere. 

In = Fi„ >Fi>f, 
so 

/ = inf fn>Fl> /, 

and therefore F^ = /. Moreover, by Exercise (2.2.4: 2), and therefore 
F,, is bounded; so / is integrable. □ 

(2.2.8) Corollary. If f,g are integrable nonnegative functions, then so 
are f V g and f A g. 

Proof. The integrability of / A g is a special case of Proposition (2.2.7); 
that of / V then follows from the identity 

f'd g = f + g- f Ag, 

Proposition (2.2.6), and Exercise (2.2.4: 8). □ 

We now extend the Lebesgue integral to functions of variable sign. We say 
that a real-valued function / defined almost everywhere on R is (Lebesgue) 
integrable if there exist integrable nonnegative functions f\ , /2 such that 
/ = /i — / 2 ; we then define the (Lebesgue) integral of / to be 



(2.2.9) Exercises 

.1 Prove that the foregoing is a good definition — in other words, that if 
fij f 2 , f 3 , fA are integrable nonnegative functions such that /i — /2 = 
fs — fA, then J fi — J f 2 = f fs — f fA - Prove also that if a nonnegative 
function / has a bounded Lebesgue primitive, then it is integrable in 
the new sense, and its integrals in the old and new senses coincide. 

.2 Show that / is integrable if and only if /+ = /V 0 and f~ = (— /) V 0 
are integrable, in which case J f = J f^ ~ J f~ ■ (Choose integrable 
nonnegative functions /i ,/2 such that / = /i — / 2 , and note that 
/+ = /i - /i A /2.) 

.3 Prove that if /, g are integrable and A G R, then f + g and A/ are 
integrable, f (f + g) = f f + f g, and f \f = X f f. (For the last part 
you will first need to complete the proof of Proposition (2.2.6).) 
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.4 Prove that if f,g are integrable functions such that f > g almost 
everywhere, then J f > f g- 

.5 Prove that if / is integrable, then so is |/| , and |/ /| < f |/| . 

.6 Show that if / and g are integrable, then so are f \/ g and f A g. 

(Reduce to the case where / and g are nonnegative.) 

.7 Let {fn)^=o be a sequence of integrable functions. Prove that 

(i) if /„ > /o almost everywhere, then inf„>i /„ is integrable; 

(ii) if /„ < /o almost everywhere, then sup„>j^ /„ is integrable. 

.8 Let / be a step function — that is, a function, defined almost 
everywhere on R, for which there exist points 



a = Xi < X2 < ■ ■ ■ < Xn = b 

and real numbers ci, . . . , c„_i such that 

{ Ci if Xi < X < Xi+i 

0 if x < a or a; > 5. 

Give two proofs that / is integrable and that 

/ n— 1 

^ Ci{Xi+i - Xi). 

.9 Let / be integrable, t a real number, and g{x) = f{x+t). Prove that g 
is integrable, with f g = f f- ( Translation invariance of the Lebesgue 
integral. First consider the case where / is nonnegative. Let F* be an 
extremal element of Vf, and define G*(x) = F'*(a; + t); prove that G* 
is a bounded Lebesgue primitive of g, and that f g <//■) 

Let A be a subset of R. The characteristic function of A is the mapping 
Xa : R — >■ R defined by 



r 1 ii x & A 

Xa{x) = I 

[ 0 ii x ^ A. 



A function / defined almost everywhere is said to be integrable over A if 
fXA is integrable, in which case we define 
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If A is a compact interval [o, b] , we write / for f. If ^ is a closed 
infinite interval, we use the natural analogous notations; for example, if 
A = [a, oo), we write /“ / for /. 

(2.2.10) Proposition. If f is an integr able function, then f is integrable 
over any interval. Moreover, if f is nonnegative and F* is an extremal 
element ofVf, then 

[ f = F4b)-F4a) 

J a 

whenever a < b. 

Proof. We only discuss the case where / is nonnegative and the interval 
is of the form I = [a, b] with a < b. Accordingly, we define 

Ft (a) if a; < a 
F(x) = < F*(a:) if a < x < & 

F* (6) if a; > &. 

Then F is a Lebesgne primitive of fxi and so belongs to F/x/- We show 
that F is an extremal element of V/xi- Let G G 'Pfxw and for each pair of 
real numbers a, fS with a < (3 define 

F^{x) + G{a) — F^{a) \i x < a, 

Ha,0{x) = < G{x) \i a < X < !3, 

F*(a:) + G(/3) - F*(/3) \i x> (3. 

Note that ii a < a < (3 < b, then G F/. Consider real numbers f,,rj 
with ^ < ? 7 . If ?7 < a or ^ > 6, then F(? 7 ) = F{f) and so 

F{r,) - F{i) < Gfn) - G{i) (3) 

holds trivially. If a < ^ < ?7 < &, then 

F{7j) - F(0 = FM - 

= G{v)-G{0- 

If f < a and r] > b, then, as Ha^b & F/ and G is increasing, 

F{g) - F{0 = F,{b) - F,{a) 

< HaAb) - HaAA 
= G{b) - G{a) 

< G{A - G(?). 




2.2 The Lebesgue Integral as an Antiderivative 101 



Hence (3) holds in all possible cases, so F is an extremal element of V/xi- 
Since F is bounded by F*, / is integrable over I and 



f f = F{oo) - F{-oo) = F^{b) - F,(a). 
J a 



□ 



(2.2.11) Exercises 



.1 



Let / be an integrable nonnegative function, and F* an extremal 
element of Vf. Prove that for each x S R, / is integrable over (— oo, x] 
and 



/ / = F,(x) 



F,(— oo). 



.2 Complete the proof of Proposition (2.2.10) in the remaining cases. 

.3 Let / be a nonnegative integrable function such that / = 0 for 
each X G R. Prove that / = 0 almost everywhere. 

.4 Find expressions for XAnBi Xaub, and x. 4 \b in terms of XA and xb- 
Prove that if / is integrable over both A and B, then it is integrable 
over A n F, A U F, and A\B. Prove also that 

(i) if A and F are disjoint, then / = J^f + f; 

(ii) if F C A, then f = lAf-JBf- 

.5 Let / be a nonnegative integrable function, [a, b] a compact interval, 
and m a real number such that /(x) > m for each x G (a, b) . Give two 
proofs that f > m{b— a). (For one proof use Proposition (2.1.7).) 

.6 Let / be a nonnegative integrable function, F a bounded Lebesgue 
primitive of /, and [a, b] a compact interval. Must we have f = 
F(6)-F(a)? 



The power of the Lebesgue integral only appears when we consider the 
interplay between the operations of integration and of taking limits. There 
now follows a string of results and exercises that deal with this topic. 

A sequence {fn)'^=i of real-valued functions defined almost everywhere 
is said to be increasing (respectively, decreasing) if /i < /2 < • • • 
(respectively, /i > /2 > • • •) almost everywhere. 

(2.2.12) Beppo Levi’s Theorem. Let (/„) he an increasing sequence 
of integrable functions such that the corresponding sequence of integrals 
is bounded above. Then (/„) converges almost everywhere to an integrable 
function f, and J f = lim„_>oo / fn- 
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Proof. Replacing /„ by /„ — /i if necessary, we may assume that /„ > 0. 
Choose M > 0 such that f fn < M for each n. By Exercise (2.2.11: 1) and 
Corollary (2.2.2), 

Fn{x) = ( fn 

J — OO 

defines an extremal element Fn of P/„- Since 

fn\(^ — OD,x] — /n+lX(— C 30 ,a:] — 

it follows from Exercise (2.2.9: 4) that {Fn{x))'^^i is an increasing sequence 
that is bounded above by M and therefore converges to a limit F{x) < 
M . Since each F„ is an increasing function, so is F; whence, by Theorem 
(2.1.10), F is differentiable almost everywhere. 

If m > n, then F^ = fm > fn almost everywhere, so Fm G P/„. Thus if 
X < y, then 

Fn{y) - Fn{x) < F„i{y) - Fm{x)-, 
letting TO — >■ OO, we obtain 



Fn{y)-Fn{x)<F{y)-F{x). 

It follows that F' > Ff = fn almost everywhere, which ensures that, almost 
everywhere, the increasing sequence (/„) converges to a limit / satisfying 

/ = sup/„ < F'. 

Since F' is integrable {F is a bounded Lebesgue primitive of F') , it follows 
from Exercise (2.2.9: 7) that / is integrable. Finally, by Exercises (2.2.4: 4) 
and (2.2.9: 4), 

F(oo) - F(-oo) > J F' > J f ^ J fn 

= F„(oo) - F„(-oo) 

— >■ F{oo) — F{—oo) as n — >■ oo, 



so 



J f = F{co) - F(-oo) 



lim 



fn- 



□ 



(2.2.13) Exercises 

.1 Let a G R, and define 

fix) 



if X > 0 
0 if X < 0. 



Prove that / is integrable over [l,oo) if and only if a < 1, and that 
/ is integrable over [0, 1) if and only if a > 1. Calculate f f in each 
case. 
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.2 Define f{x) = e where a is a positive constant. Prove that / is 
integrable, and calculate / /. 

.3 Let / be an integrable function, and I a bounded interval. Use Beppo 
Levi’s Theorem to prove that / is integrable over I. (Consider the 
sequence (/ A where gn{x) = n ii x € I, and gn{x) = 0 

otherwise.) Extend this result to an unbounded interval I. (First take 
/ > 0. Consider the sequence (/«), where fn{x) = f{x) if x G I (1 
[—n,n] , and /„(x) = 0 otherwise.) 

.4 Prove Lehesgue’s Series Theorem: if /« is a series of integrable 
functions such that the series / l/«l converges, then /« 

converges almost everywhere to an integrable function, and 

/ OO OO « 

71 = 1 '' 

(Consider the partial sums of the series fn Yl^=i fn-) 

.5 Use the preceding exercise to give another proof that if / is a non- 
negative integrable function satisfying f f = 0, then / = 0 almost 
everywhere (See also Exercises (2.2.11:3) and (2.2.4: 6).) 

.6 Let (A„) be a sequence of subsets of R, and / a function that is 
integrable over each An, such that /a converges. Prove 

that 



(i) / is integrable over A = U“=i A„, and |/| < X)“=i /a„ I/I 5 

(ii) if also the sets A„ are pairwise-disjoint, then f^f = Ia /■ 



.7 Let / be an integrable function, and e > 0. Show that there exists 
a bounded interval I such that /R\7 I/I < £. (Consider \ f\xn, where 
Xn is the characteristic function of [— n, n ] .) 

.8 Prove that the series ^ converges for each x > 0. Define 



fix) 



ifx>0 
0 if X < 0. 



Prove that / is integrable, and that / / = 

.9 Let (/„) be a sequence of integrable functions such that 0 < /i < 
/2 < • • ■ almost everywhere. Show that lim„_>oo f fn = 0 if and only if 
lim„_>oo fn{x) = 0 almost everywhere. (For “only if” choose a subse- 
quence ifnk)T=i such that / fn^ < 2“^ for each k, and use Lebesgue’s 
Series Theorem.) 




104 



2. Differentiation and the Lebesgue Integral 



.10 Let (/„) be a sequence of step functions such that 0 < /„+i < /„ 
almost everywhere and lim„_>oo / /n = 0. Without using any of the 
foregoing theorems or exercises about the convergence of integrals, 
prove that lim„_>oo fn{x) = 0 almost everywhere.^ (Use the Vital! 
Covering Theorem.) 

.11 Prove Fatou’s Lemma: if (/„) is a sequence of nonnegative integrable 
functions that converges almost everywhere to a function /, and if 
the sequence (/ is bounded above, then / is integrable and 

y./'<liminf J fn- 



(Apply Beppo Levi’s Theorem to the functions = inffc>„ fk-) 

.12 Let / be defined almost everywhere, and suppose that for each e > 0 
there exist integrable functions g, h such that g < f < h almost 
everywhere and J{h — g) < e. Prove that / is integrable. (For each 
n choose integrable functions g„,h„ such that gn ^ f ^ hn almost 
everywhere and / (/i„ — (?„) < 2“”.) 

.13 Prove that if if is a set of measure zero, then there exists a non- 
negative integrable function / such that / / = 0 and f{x) = 1 for 
all X G E. (For each positive integer n choose a sequence {In,k)T=i 
of pairwise-disjoint bounded open intervals such that E C A„ = 
U^i In,k and \In, k\ < 1/n. Let / be the characteristic function 

Let /, g be functions defined almost everywhere. We say that g dominates 
/ if I/I "Eg almost everywhere. 

(2.2.14) Lebesgue’s Dominated Convergence Theorem. Let {fn) 
be a sequence of integrable functions that converges almost everywhere to 
a function /, and suppose that there exists an integrable function g that 
dominates each fn- Then f is integrable, and J f = lim„_,.oo / fn- 

Proof. The functions 

g„ = sup fk 

k'>n 

are integrable, by Exercise (2.2.9: 7), and form a decreasing sequence con- 
verging to / almost everywhere. Noting that f (—gn) < / 5 ) we now apply 



^This is the basic result in another approach to Lebesgue integration on R, 
which starts by defining the integral of a step function and then considers the 
convergence of a sequence (fn) of step functions when the corresponding sequence 
of integrals is bounded above; see [40] . 
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Beppo Levi’s theorem to the sequence (— 5 «) to show that / is integrable 
and that /(/„—>■ / /. Replacing /„ by —f„ in this argument, we see that 
J J f, where 

hn = inf fk- 

k^n 

Finally, hn < fn < On, so 



/ 



hn ^ 



fn< 



9ny 



and therefore J fn ^ J f- O 



(2.2.15) Exercises 

.1 Prove that if / is an integrable function, then J (/ A n) — >■ / / as 
n — >■ oo. 

.2 Let / be an integrable function, and for each n define /« = (/ A n) V 
—n. Prove that / 0 as n — >■ oo. 

.3 Give two proofs that if / is an integrable function, then f (|/| A n“^) 
— 0 as n — >■ oo. 

.4 Give an example of a sequence (/„) of integrable functions such that 
lim„_>oo /n = 0 almost everywhere, lim„_>oo / /« = 0, and there is no 
integrable function that dominates each /„. 

.5 Let (fn) be a sequence of integrable functions converging almost ev- 
erywhere to a function /, and let g be an integrable function that 
dominates /. Show that / is integrable, and that J f = lim„_>oo / fn- 
(Gonsider the functions {fn A g) V —g.) 



With the help of Lebesgue’s Dominated Gonvergence Theorem we can 
prove the converse of Exercise (2.2.4: 3), and thereby, for a nonnegative in- 
tegrable function /, complete the characterisation of the extremal elements 
of Vf among the Lebesgue primitives of /. 

(2.2.16) Proposition. If f is a nonnegative integrable function, then 
each extremal element of Vf is absolutely continuous on each compact 
interval. 

Proof. Let I = [a, b] be a compact interval. Given an extremal element F 
ofVf, consider first the case where / is bounded above almost everywhere 
by some constant M > 0. The function x i— >■ Mx is increasing and has 
derivative M > f almost everywhere. It follows from Proposition (2.2.1) 
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that F{r]) — F{^) < M{rj — ^) whenever ^ < rj. So if ([a^, bk])^^i is a finite 
sequence of nonoverlapping subintervals of I, then 

n n 

Y^\F{b,)-F{a,)\<MY,{b, - a,k), 

k=l k=l 

from which the absolute continuity of F readily follows. 

In the general case we define 



fn = U -n 



for each positive integer n. Given e > 0, we see from Exercise (2.2.15: 2) that 
there exists N such that J \ f — /n\ < Choose an extremal element Fjv 
of Vfpf If is a finite sequence of nonoverlapping subintervals 

of I, then by Proposition (2.2.10), 



k=l 



J2\F{bk)-F{ak)\ = J2\ / 

fc=i r 
” fbk 

- 

fc=l 

<'^\FN{bk) - FN{ak)\ + / \f - Jn\ 

7- T J a 



E/ I/-/-I 

k^l 

rb 



n 

< E \^N{bk) — FN{ak)\ + s. 

fc=i 



Since, by the first part of the proof, Fn is absolutely continuous, it follows 
that F is absolutely continuous. □ 



(2.2.17) Exercises 

.1 Let / be an integrable function, and F the function defined by 

P{^) = f /• 

J — OO 

Prove that F is absolutely continuous on each compact interval. 

.2 Prove that if G : R — >■ R is absolutely continuous on each com- 
pact interval, then there exists an integrable function g such that 
G' = g almost everywhere. (Note that for a < x, G{x) = Tc{a,x) — 
{Ta{a,x) - G{x )) .) 

.3 Let / be a nonnegative continuous function on a compact interval 
I = [a, b] , and extend / to R by setting f{x) = 0 for all x outside I. 
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Prove that / is integrable over [a,b], and that /^ / = /^/, where f 



, (Let 




0 


if X < a 


f/ 


if a < X <b 


If 


if X > b, 



and show that F is absolutely continuous.) 



Two fundamental techniques of calculus are changing the variable in an 
integral, and integration by parts. We now deal with the former, the latter 
being left to the next set of exercises. 

(2.2.18) Proposition. Let g be an absolutely continuous, increasing 
function on I = [a, /3] , a = g{a), b = g{(3), and / an integrable function 
on [a, 6]. Then {f o g)g' is integrable, and 

rP 

/ = / if°g)9'- 

Ja 




Proof. We may take f = g = 0 outside [a, 6] . By considering /+ and 
f~ separately, we reduce to the case where / is nonnegative. Moreover, 
we may assume that / is bounded: for if we have proved the proposition 
in the bounded case, we obtain the desired result in the general case for 
nonnegative / by considering / An, letting n — 1 oo, and using Beppo Levi’s 
Theorem. 

Note that, by Corollary (1.4.12), g maps [a,/3] onto [a, 6]. Choose M 
such that 0 < / < M, and let F be an extremal element of P/. Then the 
function 

G = F o g 

is increasing. Since t ^ Mt belongs to V f, ii a < f < g < P, then 

G{g) - G(0 = F{g{g)) - F{g{0) < Mg{g) - Mg{f)-, (4) 

so the function Mg — G is increasing. Since, by Exercise (2.2.4: 3), Mg is 
an extremal element of VMg'i we can apply Lemma (2.2.5) with ^ = Mg 
and 41 = G, to show that G is an extremal element of Vg’', whence 

/' / = m - F{a) = G{p) - G{a) = G'. 

J a Ja 

It therefore remains to prove that 

G'{t) = f{g{t))g'{t) (5) 
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almost everywhere in I. 

Consider the set of those t G {a, (3) for which (5) fails to hold. This may 
be split into five subsets, as follows: 



• the set Ax of measure zero on which G'{t) does not exist; 

• the set A 2 of measure zero on which g'{t) does not exist (see Exercises 
(2. 1.6:4) and (2.1.12:3)); 

• the set A 3 of measure zero on which F'{g{t)) does not exist; 

• the set A 4 of measure zero on which f{g{t)) does not exist; 

• the set B of those t G I\ Ut=i such that F'{g{t)) ^ f{g{t)). 



To complete the proof for bounded nonnegative /, we show that B has 
measure zero, li t G B and g'{t) = 0, then it follows from (4) that 



\G'{t)\ < lim 

h—yO 



\Mg{t + h) - Mg{t)\ 
\h\ 



Mg\t) = 0, 



so (5) holds. Let 



C = {t G B \ g'{t) exists and is nonzero, and F'{g{t)) ^ f{g(t))} . 



Since E is a Lebesgue primitive of /, g{G) has measure zero; we must show 
that G itself has measure zero. To this end, for all positive integers to, n 
let Gm,n be the set of those t G G such that A a < ti < t < t 2 < (3 and 
g{t 2 ) - g{ti) < 1 /to, then 

g{t 2 ) - g{ti) > - — —■ 
n 

Then (Exercise (2.2.19: 1)) C = Um n=i C'm.n, so we need only prove that 
for fixed m and n, Cm,n has measure zero. Since Gm,n C C, g{Cm,n) has 
measure zero; so for each e > 0 there exists a sequence ([ofe, of 

compact subintervals of (a, b) such that 



(i) bk — ak < 1 /to for each k, 

(ii) g{Cra,n) C (J^i [afc,&fc] , and 

(hi) - ak) < e/n. 

Clearly, we may assume that g{Gm,n) H (ak,bk) is nonempty for each k. 
Since g is continuous and increasing, it follows from the Intermediate Value 
Theorem that each [ 0 ^, 6 ^] is the image under g oi a, compact subinterval 
[oik,(3k] of [a, j3] . For each k choose t G Gm,n with ak < t < (3k- Since 

g{Pk) - g{ak) = bk - ak < 
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the definition of Cm,n ensures that 



h-ak> 



Pk — Oik 



Thus the intervals [ak,Pk] cover Cm,n and have total length 

OO OO 

- ttfe) < ^ n{bk - ak) < s. 

k=l k=l 

Since e is arbitrary, it follows that Cm,n has measure zero. □ 



(2.2.19) Exercises 



.1 In the notation of the proof of Proposition (2.2.18), show that C = 

[Jm,n=l ^m,n- 

.2 This exercise deals with integration by parts. Let /, g be integrable 
functions, and I = [a, b] a compact interval. For a < x < b define 

F{x)=rf, G{x)= f\. 

J a J a 

Prove that the functions Fg and /G, extended to equal 0 outside /, 
are integrable over / and that 





fG = F{b)G{b) - F{a)G{a). 



The final set of exercises in this section explores further the relation 
between Riemann and Lebesgue integration. For this purpose, we again 
denote the Riemann integral by f. 

(2.2.20) Exercises 

.1 Let the bounded function / be Riemann integrable over the compact 
interval I = [a, b] . Show that for each e > 0 there exist step functions 
g, h that vanish outside I, such that g < f < h, 

and J {h — g) < e. Then use Exercise (2.2.13: 12) to deduce that / 
is Lebesgue integrable over I and that the Lebesgue and Riemann 
integrals of / over I are equal. 
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.2 Define / : [0, 1] — >■ R by 

fix) = 



1 if X is irrational 
0 if X is rational. 



Show that /, which we have already shown is not Riemann integrable 
(Exercise (1.5.10: 6)), is Lebesgue integrable over [0, 1], with / = 1. 

.3 Let / be a bounded nonnegative function on R that is Riemann 
integrable over each compact interval, such that the infinite Riemann 
integral J = lim„_>oo Jflnf exists. Prove that / is Lebesgue integrable 
and that its Lebesgue integral equals J. Need this conclusion hold if 
/ is allowed to take negative values? 

.4 Let (/„) be an increasing sequence of Riemann integrable functions 
over a compact interval [a, b] , such that /(x) = lim„_>oo fnix) defines 
a Riemann integrable function over [a, b]. Prove that 





2.3 Measurable Sets and Functions 

A function / defined almost everywhere on R is said to be measurable if it 
is the limit almost everywhere of a sequence of integrable functions. Clearly, 
an integrable function is measurable. 

(2.3.1) Proposition. If a measurable function is dominated by an 
integrable function, then it is integrable. 

Proof. Let g be an integrable function dominating a measurable function 
/, and choose a sequence (/«) of integrable functions converging to / almost 
everywhere. For each n define 

gn = ifn A g) V -g. 

Then is integrable, by Exercise (2.2.9: 6), and is dominated by g; also, 
lim = (/ A 5) V -5 = / 

n—^oo 

almost everywhere. It follows from Lebesgue’s Dominated Convergence 
Theorem (2.2.14) that / is integrable. □ 

(2.3.2) Corollary. A measurable function f is integrable if and only if 
I/I is integrable. 
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Proof. If I/I is integrable, then, as it dominates /, we see from Propo- 
sition (2.3.1) that / is integrable. For the converse we refer to Exercise 
(2.2.9: 5). □ 

(2.3.3) Exercises 

.1 Prove that if / is a measurable function and I is an interval, then 
fXi is measurable. 

.2 Prove that a continuous function / : R — >■ R is measurable. 

.3 Let /, g be measurable functions. Prove that f + g, f — g, f'^g, and 
f A g are measurable. 

.4 Let (/„) be a sequence of measurable functions that converges almost 
everywhere to a function /. Prove that / is measurable. (For each k 
define the step function gk by 

f k if — k < X < k 

9k{x) = < 

[ 0 otherwise. 

First prove that (/ A gt) V —gk is integrable.) 

.5 Let / be a measurable function, and p a positive number. Prove that 
l/l^ is measurable. 

.6 Give an example of a measurable function / which is not integrable 
even though is. 

.7 Give two proofs that the product of two measurable functions is 
measurable. (For one proof use Exercises (2.3.3: 3 and 5).) 

.8 Let the measurable function / be nonzero almost everywhere. Prove 
that 1// is measurable. (First consider the case where / > c almost 
everywhere for some positive constant c. For general / > 0 consider 
fn = l/{f + n-^).) 

.9 Let / be a measurable function, and (/? : R — >■ R a continuous func- 
tion. Prove that (fof is measurable. (Reduce to the case where / van- 
ishes outside a compact interval [o, b] . Then use Exercise (2.2.4: 9) to 
construct a sequence (/„) of step functions that vanish outside [o, 6] 
and converge almost everywhere to /.) 

.10 Let —2 < a < —1, and define 

sin X if X > 0 
0 if X < 0. 



Prove that / is integrable over (0, oo). 
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.11 Define 



fix) = 



sin X 



0 



if a; > 0 



if a; < 0. 



Prove that / is measurable but not integrable. (For the second part 
suppose that / is integrable, so |/| is integrable. Use the inequality 



i/i>E 



/•2n'7r+7r/3 



/ (iVGN+) 



n=l • 

to derive a contradiction.) 



.12 Let a > 0, and define 



fix) 



e ®a;“ ^ if a; > 0 
0 if a; < 0. 



Prove that / is integrable. (Consider the functions fx{-oo,i] and 
/X(i.oo) separately.) 

.13 Give two proofs of the Riemann-Lehesgue Lemma: if / is an integrable 
function, then the functions x i— f{x) sinna; and x i— f{x) cosnx are 
integrable, and 



lim 



f{x) sinna; da; = 0, 



lim 

n— >-oo 



f{x) cosna;da; = 0. 



(One proof proceeds like this. First reduce to the case where / > 0 
and / vanishes outside a compact interval I = [— iV7r, Ntt] for some 
positive integer N. Let F be an extremal element oi Vf, and carry 
out integration by parts on F'{x) sin nx dx.) 



.14 Let ip, 4!, 9 be nonnegative bounded integrable functions on / = [0, c] 
such that 



Prove that 



9ix) < p{x) + [ i/’(t)6*(f) dt (x G /). 

Jo 

9ix) < p{x) + J (p{t)4it) exp i/>(s) ds^ dt (x G /). 



(Define 



7 (x) = f 4{t)9{t)dt, 

J 0 

A(x) = y(x) exp J tpi^) • 
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Show that 

A'(x) < (p{x)ip{x) exp J 

almost everywhere on /, and then use Proposition (2.1.7).) 

A subset A of R is called a measurable set (respectively, integrable set) if 
Xa is a measurable (respectively, integrable) function. A measurable subset 
of an integrable set is integrable, by Proposition (2.3.1). 

If A C R is integrable, we define its (Lebesgue) measure to be fi{A) = 
f XA- 

(2.3.4) Exercises 

.1 Let A, B be measurable sets. Prove that A U i?, AnB, and A\B are 
measurable. 

.2 Let (A„) be a sequence of pairwise-disjoint measurable sets. Prove 
that 

(i) U^i is measurable; 

(ii) if l^{An) is convergent, then A„ is integrable, and 

.3 Prove that any interval in R is measurable. 

.4 Let B be the smallest collection of subsets of R that satisfies the 
following properties. 

— Any open subset of R is in B. 

— liA&B, then R\A G B. 

— The union of a sequence of elements of B belongs to B. 

The elements of B are called Borel sets. Prove that any Borel set is 
measurable. 

If O is a binary relation on R and /, g are functions defined almost 
everywhere on R, we define 

1/^5] = {x G R : f{x)Og{x)}. 



So, for example. 



1/ > 5 ] = {x G R : /(x) > g{x)} . 



We also use analogous notations such as 



[a < / < ^1 = {x G R : a < /(x) < b} . 
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Just as the measurability of a set is related to that of a corresponding 
(characteristic) function, so the measurability of a function is related to 
that of certain associated sets. 

(2.3.5) Proposition. Let f be a real-valued function defined almost ev- 
erywhere. Then f is measurable if and only if |/ > r] is measurable for 
each r G R. 

Proof. Suppose that / is measurable, let r G R, and for each positive 
integer n define 

r _ {f-r) + 

i + if-r)+- 

Since the functions 1 1— 1 f*" and 



are continuous on R and R°“*", respectively, we see from Exercises (2.3.3: 2 
and 9) that /„ is measurable. But lim„_>oo fn = X[/>r] almost everywhere, 
so 1/ > r] is measurable, by Exercise (2.3.3: 4). 

Now assume, conversely, that |/ > r] is measurable for each r G R. 
Given a positive integer n, choose real numbers 

...,r_ 2 ,r_i,ro,ri,r 2 ,... 

such that 0 < Vk+i — rk < 2“” for each k. Then 

< / < rfcl = 1/ > rfe_i]\|/ > rfc] 

is measurable, by Exercise (2.3.4: 1); let Xk denote its characteristic 
function. The function 

OO 

fn — ^ ^ '^k—lXk 

k— — oo 

is measurable: for it is the limit almost everywhere of the sequence of 
partial sums of the series on the right-hand side, and Exercises (2.3.3: 3 
and 4) apply. To each x in the domain of / there corresponds a unique k 
such that rk-i < f{x) < then 

0 < f{x) - fk{x) < rfe - rk-i < 

Hence the sequence (/„) converges almost everywhere to /, which is 
therefore measurable, again by Exercise (2.3.3: 4). □ 

The exercises in the next set extend the ideas used in the proof of Propo- 
sition (2.3.5). In particular, when taken together with the subsequent dis- 
cussion of measurability in the sense of Caratheodory (a concept defined 
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shortly), the second and third exercises link our approach to integration 
with the one originally used by Lebesgue; see [40], pages 94-96. 

(2.3.6) Exercises 

.1 Let / be a function defined almost everywhere on R. Prove that the 
following conditions are equivalent. 

(i) / is measurable. 

(ii) [/ > is measurable for each r. 

(iii) [/ < is measurable for each r. 

(iv) 1/ < ?’l is measurable for each r. 

(v) |r < / < i?] is measurable whenever r < R. 

.2 In the notation of the second part of the proof of Proposition 
(2.3.5), prove that if / is nonnegative and integrable, then each /„ is 
integrable and lim„_>oo J fn = J f- 

.3 Let / be a nonnegative measurable function vanishing outside the 
interval [a,b]. For the purpose of this exercise, we call a sequence 
(r„)))^g of real numbers admissible if rg = 0 and there exists S > 
0 such that r„+i — r„ < 5 for all n; and we say that the series 
r„/x(£'„) corresponds to the admissible sequence, where 
whose characteristic function we denote by y„, is the measurable set 
[?'n-i < f < ?’nl- Suppose that this series converges. Let (r(,)))Lg 
be any admissible sequence for /, and let x'n be the characteristic 
function of E'^ = \r'^_i < f < r],]. Prove that 

(i) the series ^nXn converge almost every- 

where to integrable functions, 

(ii) E“=i r'n-iXu < f < E“=i ’■"Xn almost everywhere, 

(iii) the series ^n-iKE'n) and X)T=i rnK^n) converge, and 

(iv) EZl<-MK) <EZirnKEn). 

Hence prove that if i"nii{En) converges for at least one admis- 

sible sequence (r„), then / is integrable, and / / is both the infimum 
of the set 

OO 

rnKEn) ■■ (r„) is admissible, Vn (E„ = |r„_i < / < r„]) 

n— 1 

and the supremum of the set 



r„_i^(E„) : (r„) is admissible, Vn (E„ = |r„_i < / < r„]) 
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.4 By a simple function we mean a finite sum of functions of the form cy, 
where c G R and x is the characteristic function of an integrable set. 
Let / be a nonnegative integrable function. Show that there exists a 
sequence (/„) of simple functions such that 

(i) 0 < /n < / for each n, 

(ii) / = fn almost everywhere, and 

(iii) // = E“i//n- 

(First reduce to the case where / is nonnegative and vanishes outside 
a compact interval. Then use the preceding exercise to construct fk 
inductively such that f (^f ~ J2n=i /") < 2“'".) 

This exercise relates our development to axiomatic measure theory, 
which is based on primitive notions of a “measurable subset” of a set 
X and the “measure” of such a set, and in which the integral is often 
built up in the following way. First, define a function / : X — >■ R 
to be measurable if |/ < a] is a measurable set for each a G R. 
Next, define the integral of a simple function X)^=iCnXA„) where 
the measurable sets An are pairwise-disjoint, to be 
If / is a nonnegative measurable function, then define its integral to 
be the supremum of the integrals of simple functions s which satisfy 
0 < s < / on the complement of a set whose measure is 0. For this 
approach to integration see, for example, [43] or [44]. 

There is another definition of measurability for sets, due to Caratheodory: 
we call a set A C R C-measurable if 

for each compact interval I . We prove two lemmas that enable us to show 
that this notion of measurability is equivalent to our original one. 

(2.3.7) Lemma. Let f be an integrable function. Then there exists a se- 
quence (fn) of step functions converging almost everywhere to f. Moreover, 
if f vanishes outside a compact interval /, then 

(i) fn can be chosen to vanish outside /; and 

(ii) if in addition, f is the characteristic function of an integrable set, 
then fn can be taken as the characteristic function of a finite union 
of subintervals of I. 

Proof. Since / = lim„_>oo fX[-n,n], h suffices to consider the case where 
/ vanishes outside a compact interval I = [a, b]. For each n let 



^ — ^n,0 ^ ^n,l ^ * * * ^ X^n.2'^ — b 
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be a partition of [a, b] into 2 ” subintervals of equal length. Define a step 
function /„ by setting 



fn{x) 



and let 



/ Xn,j + 1 

f if Xn^j ^ X 

0 otherwise, 



OO 

if = [J {xn,j : 0 < j < 2"} U {a; G R : f{x) is undefined} . 

n=l 



Then E has measure zero, and, by Exercise (2.2.4: 9), lim„_>oo fn{x) = f{x) 
for all X in E. This completes the proof of (i). 

Now suppose that / is the characteristic function of an integrable set 
Adi, and, using the first part of the proof, construct a sequence of 
step functions that vanish outside / and converge almost everywhere to /. 
Define 

r 1 if i 

/n= < 

[ 0 otherwise. 

Then /„ is the characteristic function of a finite union of subintervals of I, 
and / = lim„_>oo fn almost everywhere. □ 



(2.3.8) Lemma. Let A he ciTi tfiteyvable subset of a coT/ipeict iTitevval I . 
Then there exists a decreasing sequence (x„) of integrable functions con- 
verging almost everywhere to xa, such that each Xn is the characteristic 
function of a countable union of pairwise-disjoint bounded open intervals. 



Proof. Using Lemma (2.3.7), choose a sequence (/„) of step functions 
that vanish outside I and converge almost everywhere to xa, such that 
each fn is the characteristic function of a finite union of pairwise-disjoint 
bounded open intervals. Then XA is also the limit, almost everywhere, of 
the decreasing sequence ((?«), where 

g„ = sup/fe. 

fe>n 

Also, gn is the characteristic function of which is a countable 

union of bounded open subintervals of I . We now build up a sequence 
{Tk)’k=n finite collections of pairwise-disjoint bounded open intervals, as 
follows: taking T„ = Sn, suppose we have constructed T^r for some N > n, 
and form T^+i by adjoining to Tn all the intervals of the form J\ IJ^n 
with J G Sn+i- Let Xn be the characteristic function of (J^n ^^k, which is a 
countable union of pairwise-disjoint bounded open intervals. Then 
almost everywhere, so Xn converges to XA almost everywhere. □ 
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(2.3.9) Proposition. Let A he a subset o/R. Then 

(i) A is measurable if and only if it is C-measurahle; 

(ii) A is integrahle if and only if it is measurable and has finite outer 
measure, in which case yi*{A) = yi{A). 



Proof. Assume, to begin with, that A is C-measurable. Given a compact 
interval / = [a, b] and e > 0, choose sequences (/„) and ( J„) of bounded 
open intervals such that 



An/C U /„, 

n—1 

oo 

I\A c U J„, 

n=l 

oo 

Y,\In\<T*{Anl) + e/2, 

n—1 

oo 

'£\Jn\<T*{I\A)+e/2. 

n—1 

By Lebesgue’s Series Theorem (Exercise (2.2.13:4)), the functions 

OO 

9 = XI - 

n—1 

oo 

n—1 

are defined almost everywhere and integrable, 

/ oo 

g=b-a-'^\Jn\, 

n^l 

and 

/ oo 
n^l 

So 

J {h — g) < fi* {An I) + fi*{I\A) — {b — a) + s = s. 



Since 



9 < XAni < h 
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almost everywhere and e > 0 is arbitrary, we see from Exercise (2.2.13: 12) 
that XAni is integrable. Moreover, 

j XAni ^ J 9 

> b-a- ii*{I\A) - I 



J XAni < J h < fj,* {A n I) + -. 



Again as £ > 0 is arbitrary, we see that f XAni = I). 

Since XA is the limit of the sequence {xAn[-n,n])^^it it follows that A is 
measurable in our original sense. If also /i*(A) is finite, then 

lim fj.* (An [—n,n]) = fi* (A), 

n—^oo 

by Exercise (2.1.1:10); so, applying Beppo Levi’s Theorem (2.2.12), we 
conclude that A is integrable, with fi{A) = /i*(A). On the other hand, if 
A is integrable, then Lebesgue’s Dominated Convergence Theorem (2.2.14) 
shows that 

lim ii*{Ar\ [-n,n\) = / xa- 

n—¥oo J 

It then follows from Exercise (2.1.1: 10) that /r*(A) = fJ,{A). 

It remains to prove that measurability implies C-measurability. Accord- 
ingly, let A be measurable in our original sense, and again let / = [a, b] 
be any compact interval. Using Lemma (2.3.8), construct a decreasing se- 
quence (xn) of integrable functions converging almost everywhere to XAnij 
such that each is the characteristic function of the union of a sequence 
{In,k)'^i of pairwise-disjoint bounded open intervals. Then (J^i 
eludes {A n I)\E, where if is a (possibly empty) set of measure zero; 



OO ^ 

fJ,*{A n /) < ^ |/„XI = J Xn, 



the last equality being a consequence of Beppo Levi’s Theorem (2.2.12). 
By Lebesgue’s Dominated Convergence Theorem, we now have 



Similarly, 



/r(An/)= lim [ Xn > (A C\ I) . 

n—^oo J 



b — a — /i(A n i) = n{I\A) > p*{I\A), 



^i*{AcM)+^i*{I\A)<b-a. 
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But Exercise (2.1.1: 6) shows that fl /) + fj,*{I\A) > b — a; so 
fi*{An I) + fi*{I\A) = b - a, 
and therefore A is measurable. □ 

(2.3.10) Exercise 

Let / be a compact interval, and / an integrable function that van- 
ishes outside I. Prove that there exists a sequence (/„) of continuous 
functions, each vanishing outside I, such that J \ f — fn\ 0- (Re- 
duce to the case where / is bounded. Then use Lemma (2.3.7) to 
reduce to the case where / is a step function.) 

Are all subsets of R measurable? No: the Axiom of Choice (Appendix B) 
ensures that nonmeasurable sets exist.® To show this, following Zermelo, 
we define an equivalence relation ~ on [0, 1) by 

a; ~ y if and only ii x — y € Q. 

Let X denote the equivalence class of x under this relation. By the Axiom 
of Choice, there exists a function (j) on the set of these equivalence classes 
such that 

4>{x) G X {x G [0, 1)). 

Let 

E= {(j}{x) : X G [0,1)}. 

Now let ri, r 2 , . . . be a one-one enumeration of Q D [0, 1), and for each n 
define 



An = En [0,r„), 

Bn = ED [r„, 1), 

= { 3 ^ G [0, 1) : a; -|- r„ — 1 G A„} , 

E^^ = {x G [0, 1) :x + rnG B„} , 

En = U 

We show that if Vn < r^, then the sets Em,En are disjoint. To this end, 
first note that 



Ek = {x G [0, 1) : a: -I- Tfc - [a; -I- rfej G E} , 



®Solovay [48] has shown that there is a model of Zermelo-Fraenkel set theory, 
without the Axiom of Choice, in which every subset of R is Lebesgue measurable. 
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where [a;J denotes the integer part of the real number x. Suppose that 
X G Em n En, so that 



ym = X + Tm - \x + Tm\ & E 



and 

Vn = X + rn - [x + r„\ G E. 

Then 

ym-yn = rm-r„+ integer 

is a rational number. Since E contains exactly one element from each equiv- 
alence class under we must have ym = yn] so Xm — Xn is an integer, which 
is impossible as 0 < r„ < < 1- Hence, in fact, Em H En is empty. 

Now suppose that E is measurable; then are measurable and have 

finite measure. Since E^ and En are translates of and respectively, it 
follows from Exercise (2.2.9: 9) that is measurable, with y{En) = y{E). 
But so by Exercise (2.3.4: 2), ^2^^=! K^n) = 1- This is 

absurd, since an infinite series with all terms equal cannot converge unless 
all its terms are 0. Hence E is not measurable. 

For more on nonmeasurable sets, see Chapter 5 of [33]. 

(2.3.11) Exercises 

.1 Let if be a nonmeasurable subset of R, and A a subset of E that is 
measurable. Prove that y*{A) = 0. 

.2 Give an example of a nonmeasurable function / such that j/j is 
integrable. 

At first sight it might appear that our approach to the Lebesgue integral 
cannot be generalised to multiple integrals. However, in the context of R" 
it is relatively straightforward to develop notions of outer measure, set of 
measure zero, and Dini derivates (of a special, set-based kind), and it is not 
too hard to prove a version of the Vitali Covering Theorem and hence of 
Fubini’s Series Theorem ([46], Chapter 4). With these at hand, as Riesz has 
pointed out,® it is indeed possible to develop the Lebesgue integral in R” by 



®// ne s’agira, dans le present Memoire, que les fonctions d’une seule variable 
et il pourrait paraitre, a premiere vue, comme si notre methode etait fagonnee 
entierement sur ce cas particulier. Dans cet ordre d’idees, il convient d’ observer 
que Von auraitpu baser les considerations, au lieu de la derivee au sens ordinaire, 
sur I’idee moins exigeante de derivee par rapport a un reseau, comme s’en sert M. 
de la Vallee Poussin pour I’etude de la derivation des fonctions d’ensemble [53]. 
Non seulement que la demonstration de V existence presque partout de cette sorte 
de derivee d’une fonction monotone est presque immediate, mais en outre on 
ne rencontre aucune nouvelle difpculte quand on veut passer au cas de plusieurs 
variables et les considerations concernant Vintegrale s’etendent a ce cas general 
avec des modifications evidentes. ([39], pages 192-193) 
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methods akin to those we have used for one-dimensional integration. But 
as there are more illuminating approaches to integration on R”, especially 
once a general theory of measures has been developed (see [44] or [43]), we 
do not discuss the theory of multivariate integrals in this book. 




Part II 



Abstract Analysis 




3 

Analysis in Metric Spaces 



...an excellent play; well digested in the scenes, 
set down with as much modesty as cunning. 

HAMLET, Act 2, Scene 2 

In Section 1 we abstract many of the ideas from Chapter 1 to the context of a 
metric space, a set in which we can measure the distance between two points. In 
Section 2 we discuss limits and continuity in that context. Section 3 deals with 
compactness, which, as a substitute for finiteness, is perhaps the single most 
useful concept in analysis. The next section covers connectedness and lifts the 
Intermediate Value Theorem into its proper context. Finally, in Section 5, we 
study the product of a family of metric spaces, thereby enabling us to deal with 
analysis in R" and C". 



3.1 Metric and Topological Spaces 

The notion of a metric space generalises the properties of R that are asso- 
ciated with the distance given by the function {x,y) i-a- \x — y\. A further 
generalisation, which we touch on at the end of this section, is a topological 
space, in which, since there may be no analogue of distance, the concept of 
open set plays a primary role. 

A metric, or distance function, on a set AT is a mapping p of X x X into 
R such that the following properties hold for all x, y, z in X . 



Ml p{x, y) > 0. 
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M2 p{x, y) = 0 if and only \i x = y. 

M3 p{x,y) = p{y,x). 

M4 p{x,y) < p{x,z) + p{z,y) {triangle inequality). 

A metric space is a pair {X, p) consisting of a set X and a metric p on X; 
when the identity of the metric is clear from the context, we simply refer 
to X itself as a metric space. We use the letter p to denote the metric on 
any metric space, except where it might be confusing to do so. 

The standard example of a metric space is, of course, the real line R 
taken with the metric {x,y) i— >■ \x — y\. More generally, if S' is a subset of 
R, then the restriction of this metric to a function on S x S is a metric 
on S. Unless we say otherwise, whenever we consider S C R as a metric 
space, we assume that it carries this canonical metric. 

(3.1.1) Exercises 

.1 Let xi, . . . ,Xn be elements of a metric space X. Prove the generalised 
triangle inequality. 

p{xi,Xn) < p{xi,X2) + p{x2,Xz) H h p{Xn-l,Xn). 

.2 Let AT be a set. Prove that the mapping p : AT x X — >• R, defined by 

r 0 li X = y 

p{x,y) = < 

[ 1 a x^y 

is a metric on X. 

This metric is called the discrete metric, and X, taken with the 
discrete metric, is called a discrete space. 

.3 Prove that each of the following mappings from R” x R" to R is a 
metric on R". 

(i) {x,y) ^^^i\xi — yi\ {taxicab metric). 

(ii) {x, y) max{|a;i - yi\ '. I < i < n}. 

Here, and in the next two exercises, x = {xi, . . . ,Xn) and y = 
{yi,---,yn)- 

.4 Prove the Cauchy-Schwarz inequality, 

n / ^ \ / n \ 

^Xiy,<\^xfj ■ 

Hence prove Minkowski^ s inequality, 

/ n \ / n \ / n \ 
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.5 Show that the mapping 



{x,y) i-T 



\ 






i=l 



is a metric on R”. (Note the preceding exercise.) 

This metric is known as the Euclidean metric, and R”, taken with the 
Euclidean metric, is known as Euclidean n-space or n-dimensional 
Euclidean space. 

.6 Let p be a prime number. For each positive integer n define Vp(n) to 
be the exponent of p in the prime factorisation of n. For each rational 
number r = Em/n, where m, n are positive integers, define 



Vp (f) = fp(w) - Vp{n). 

Show that this definition does not depend on the particular represen- 
tation of r as a quotient of integers, and that if r' is also rational, 
then 

Vp{rr') = Vp{r) + Vp{r'). 

Finally, show that 



( p-M==-v) iix^y 
p{x,y) = < 

[ 0 if X = y 

defines a metric p — which we call the p-adic metric — on Q, such 
that 

p{x,z) < max{p(a;, 2 /), p{y,z)}. 

On any set X a metric p that satisfies this last property is called an 
ultrametric, and {X,p) is called an ultrametric space; clearly, X is 
then a metric space. 

.7 Let X be a nonempty set, and denote by B{X, R) the set of all 
bounded mappings of X into R. Show that 

p{f,g) =sup{\f{x) - g{x)\ : x G X} 

defines a metric on B{X, R). 

From now on, when we refer to B{X,I{.) as a metric space, it is 
understood that the metric is the one defined in this exercise. 

.8 A pseudometric on a set A is a mapping p : A x A — >■ R that 
satisfies Ml, M3, M4 and the following weakening of M2: ii x = y, 
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then p{x,y) = 0. The pair {X,p), or, loosely, X itself, is then called 
a pseudometric space. Prove that in that case, 

cc ~ y if and only if p{x, y) = 0 

defines an equivalence relation on X, and that 

p{x,y) = p{x,y) 

defines a metric p on the quotient set X/ where x is the 
corresponding equivalence class of x. 

In practice, we often identify Xj ^ with X, and thereby turn X into 
a metric space, by calling two elements x,y of X equal if p{x,y) = 0 
or, equivalently, if x = y. 

.9 Prove that 

p{f,g) = [ \f-g\ 

J a 

defines a metric on the set of continuous real-valued mappings on the 
compact interval [a, b] , where denotes the Riemann integral. 

.10 Prove that 

p{f,g) = J \f -g\ 

defines a pseudometric on the set of Lebesgue integrable functions on 

R. 

The corresponding metric space (see Exercise (3.1.1: 8)) is denoted by 
Li(R). We see from Exercise (2.2.4: 6) that two elements of Ti(R) are 
equal if and only if, as functions, they are equal almost everywhere. 

Let X and Y be metric spaces. A bijection f of X onto Y is called an 
isometry if 

P{f{x),f{y)) = p{x,y) 

for all x,y in X, in which case the inverse mapping f~^ is an isometry of 
Y onto X, and the spaces X and Y are said to be isometric (under /). 
Two isometric spaces can be regarded as indistinguishable for all practical 
purposes that involve only distance. 

Now let X be a metric space, and Y a set in one-one correspondence 
with X. With any bijection f of X onto Y there is associated a natural 
metric py on Y, defined by setting 

PY{f{x)J{y)) = p{x,y). 

We say that the metric p has been transported from AT to P by /. The 
mapping / is then an isometry from (X,p) onto {Y,py). 
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An important example of the transport of a metric occurs in connection 
with the real line R, and enables us, in Section 3.2, to discuss the conver- 
gence of sequences in a metric space as a special case of the convergence of 
functions. The mapping / defined on R by 



fix) 



X 

1 -h lsl 



{x G R) 



is an order-preserving bijection of R onto the open interval (—1, 1), with 
inverse mapping g defined by 

9iy') = Y^\ (lyl<i)- 



Let R be obtained from R by adjoining two new elements — oo and oo, 
called the points at infinity. (Note that — oo and oo are not real numbers, 
and that the real numbers are often referred to as the finite elements of 
R.) Extend / to a bijection of R onto [—1,1] by setting 



/(— oo) = —1 and /(oo) = 1. 

Then g extends to a bijection of [—1,1] onto R, such that the extended 
mapping g is the inverse of the extended mapping /. Now transport (by g) 
the standard metric (s,t) !->■ \s — t\ from [—1, 1] to R; that is, define 

P-^{x,y) = \f{x) - f{y)\ (x,j/gR). 

Taken with the metric p-p , the set R becomes a metric space, called the 
extended real line. Note that restricted to R, is different from the 
standard metric {x,y) i— \x — y\ on R. 

We introduce the order relations >, > on R (and hence the opposite 
relations <, <) by setting 

x>y if and only if f{x) > f{y), 

X > y if and only if f(x) > f{y). 

On R these relations coincide with the respective standard inequality 
relations. 



(3.1.2) Exercises 

.1 Prove that the function p^ is a metric on R. 

.2 Show that the relations > and > on R have the properties that you 
would expect. In particular, prove that 



(i) — oo < X < GO for all x G R; 
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(ii) a nonempty subset S' of R is bounded, and has a supremum and 
infimum, relative to the order > on R (where sup S and inf S 
may equal oo or — oo); 

(iii) when restricted to R, the order relations > and > on R coincide 
with the standard order relations > and > . 

Let {X, p) be a metric space, a G X, and r > 0. We define the open ball 
with centre a and radius r to be 

B{a,r) = {x G X : p(a,x) < r}, 

and the closed ball with center a and radius r to be 

B{a,r) = {x G X \ p{a,x) < r}. 

For example, the open and closed balls with centre a and radius r in R are 
the intervals (a — r, a + r) and [a — r,a + r], respectively; and the open ball 
with centre oo and radius r G (0, 1) in R is {r~^ — 1, oo) U {oo} . 

In order to define the notions of open set, interior point, interior of a 
set, neighbourhood, cluster point, closure, and closed set for a metric space 
X , in the corresponding definition for subsets of R we replace 

— the open interval {x — r,x + r) by its analogue, the open ball B{x, r) 
in X, and 

— the inequality \x — y\ < r by the inequality p{x, y) < r. 

For example, a subset A of X is said to be open (in X) if for each x G A 
there exists r > 0 such that B{x,r) C A. 

Propositions (1.3.2), (1.3.9), and (1.3.10), and the applicable parts of 
Exercises (1.3.7) and (1.3.8), carry over unchanged into the context of a 
metric space. When we mention those results in future, it is assumed that 
we are referring to their metric space versions. 

(3.1.3) Exercises 

.1 Prove that X itself, the empty set 0 C X, and the open balls in X are 
open sets; and that X, 0, and the closed balls in X are closed sets. 

.2 Give proofs of the metric space analogues of Proposition (1.3.2), 
Exercises (1.3.7: 3-8), and Exercises (1.3.8: 3-8). 

.3 Prove that a subset of X is closed if and only if X\S is open (cf. 
Proposition (1.3.9)). 

.4 Prove that the intersection of a family of closed sets is closed, and 
that the union of a finite family of closed sets is closed (cf. Proposition 
(1.3.10)). 
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.5 Suppose that p is an ultrametric on X (see Exercise (3.1.1: 6)). Prove 
the following statements. 

(i) If p{x,y) ^ p{y,z), then p{x,z) = max{p(a;,y), p{y,z)}. 

(ii) If y G B{x,r), then B{y,r) = B{x,r). 

(iii) Every open ball in is a closed set. 

(iv) If two open balls in X have a nonempty intersection, then one 
of them is a subset of the other. 

Is every closed ball in X an open set? Does (iv) hold with “open ball” 
replaced by “ball”? 

.6 Two metrics on a set are said to be equivalent if they give rise to the 
same class of open sets. Prove that the Euclidean metric is equivalent 
to each of the metrics in Exercise (3.1.1: 3). 

.7 Prove that oo is a cluster point of R, considered as a subset of the 
metric space R. 

If S' is a subset of a metric space X, then the restriction to S x S of the 
metric p on X is a metric — also denoted by p — on S, and is said to be 
induced on S by p. The set S, taken with that induced metric, is called a 
(metric) subspace of X. 

(3.1.4) Exercise 

Prove that if a; G S and r > 0, then S fl B{x,r) is the open ball, 
and S n B{x,r) is the closed ball, with centre x and radius r in the 
subspace S. 

(3.1.5) Proposition. Let S he a subspace of the metric space (X,p), and 
A a subset of S. Then A is open in S if and only if A = SHE for some 
open set E in X; and A is closed in S if and only if A = S D E for some 
closed set E in X 

Proof. We prove only the part dealing with open sets, since the other 
part then follows by considering complements. Accordingly, suppose that 
A = S E for some open set if in A, and let cc G A. Choosing r > 0 such 
that B{x,r) C E, we see that 

a; G S' n B{x, r) C S C\ E. 

Since, by Exercise (3.1.4), S fl B{x,r) is the open ball with centre x and 
radius r in S, it follows that x is an interior point of S fl if in the subspace 
S. Hence A = S fl if is open in S. 
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Conversely, suppose that A is open in S. Then, by Exercise (3.1.4), for 
each X G A there exists > 0 such that S fl B{x, Xx) C A. So 

A= U (-S' n B{x, Xx)) = S' n [J B{x, Xx), 

x£A x^A 

where the set Ua;eA ^(a;, r^,) is open in X, by (the metric space analogue 
of) Proposition (1.3.2). □ 

(3.1.6) Exercises 

In each of these exercises S' is a subspace of {X,p). 

.1 Complete the proof of Proposition (3.1.5). 

.2 Prove that the following conditions are equivalent. 

(i) Every subset of S that is open in S is open in X. 

(ii) S is open in X. 

.3 Prove that the following conditions are equivalent. 

(i) Every subset of S that is closed in S is closed in X. 

(ii) S is closed in X. 

.4 Let x G S and U C S. Show that 17 is a neighbourhood of a; in S if 
and only if 17 = S fl P for some neighbourhood V of a; in 77. 

.5 Let x G S. Show that the following conditions are equivalent. 

(i) Every neighbourhood of x in S is a neighbourhood of x in X. 

(ii) S is a neighbourhood of x in X. 

Let A and B be subsets of X. We say that A is 

• dense with xespect to B if B C A, and 

• dense in X, or evexywhexe dense, if A = X. 

The space X is called sepaxable if it contains a countable dense subset. 
For example, Q and R\Q are dense in R, by Exercises (1.1.1: 19) and 
(1.2.11: 5). Thus R is separable, as Q is countable. 

(3.1.7) Proposition. If A is dense with xespect to B, and B is dense 
with xespect to C, then A is dense with xespect to C. 

Proof. We have B C A and C C B. By Exercises (1.3.8: 7 and 3), 
B c {A) = A] whence C <G A. □ 

(3.1.8) Exercises 

.1 Show that A is dense in X if and only if each nonempty open set in 
X contains a point of A. 
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.2 Prove that (— oo, — 1) U (—1, 1) U (1, oo) is dense in R. 

.3 Prove that a nonempty subspace S' of a separable metric space X 
is separable. (Let (x„) be a dense sequence in X. For each positive 
integer m consider the set {n : p{xn, S) < 1/m} .) 

.4 Prove that the union of a countable family of separable subspaces of 
X is separable. What about the union of an uncountable family of 
separable subspaces? 

.5 A point a; of a metric space is said to be isolated if there exists r > 0 
such that B{x,r) = {x| . Prove that the set of isolated points of a 
separable metric space is either empty or countable. 

.6 Prove that a nonempty family of pairwise-disjoint, nonempty open 
subsets of a separable metric space is countable. (Use the preceding 
exercise.) 

If S' is a nonempty subset of X and x G X , then we define the distance 
from X to S to be the real number 

p{x, S) = inf{p(x, s) : s G S|. 

More generally, if T is also a nonempty subset of A, then we define 
p(S, T) = inf {p(s, t) : s G S, t G T} . 

(3.1.9) Proposition. If S is a nonempty subset of X, and x,y are two 
points of X , then 

|p(x,S) - p{y,S)\ < p(x,y). 

Proof. For each s G S we have 

p{x, S) < p{x, s) < p(x, y) + p{y, s). 



It follows that 

p(x, S) < p{x, y) + inf{p(y, s) : s G S| = p{x, y) + p{y, S) 
and therefore that 



Similarly, 



p{x,S) - p{y,S) < p{x,y). 



p{y,S) - p{x,S) < p{x,y). 

The result follows immediately. □ 

The diameter of a nonempty subset S of a metric space X is defined as 
diam(S) = sup{p(a;,7/) : x G S, y G S} 




134 



3. Analysis in Metric Spaces 



and is either a nonnegative real number or oo. Clearly, if S' C T, then 
diam(S) < diam(T); and diam(S) = 0 if and only if S contains exactly one 
point. 

A subset S of X is said to be bounded if its diameter is finite — that is, if 
diam(S) € R. 

(3.1.10) Exercises 

.1 For nonempty subsets S, T of X, prove that p(S, S) = 0 and p(S, T) = 
p{T,S). 

.2 Is it true that if S,T are closed subsets of R such that p{S,T) = 0, 
then S n T is nonempty? 

.3 Prove that a nonempty subset S of X is closed if and only if p{x, S) > 
0 for each x G X\S. 

.4 Let X be an ultrametric space, and B, B' distinct open balls of ra- 
dius r in X both of which are contained in a closed ball of radius r. 
Compute p{B,B'). (Note Exercise (3.1.3: 5).) 

.5 Is it true that 

diam(i?(a;, r)) = diam(B(a:, r)) = 2r 
for any metric space X, x G X, and r > 0? 

.6 Prove that 

(i) the union of two bounded subsets of a metric space is bounded; 

(ii) the union of finitely many bounded subsets of a metric space is 
bounded. 

Is the union of an infinite family of bounded subsets necessarily 
bounded? 

Although the notion of a metric space is sufficiently strong to underpin 
a large amount of analysis, the following more general notion is needed in 
more advanced work.^ 

A topological space (X, r) consists of a set X and a family r of subsets 
of X satisfying the following conditions. 

TOl X G T and 0 G t. 

T02 If Ai G r for each i G I, then (Jie/ ^ 



^As we do not use the notion of a topology, other than a metric one, in the 
remainder of this book, this part of the section can be skipped without penalty. 
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T03 If € T and A2 € r, then fi ^2 G t. 

T is called the topology of the space, and the elements of r the open sets 
of that topology. When the topology r is clear from the context, we speak 
loosely of X as a topological space and of the elements of r as open sets in 

X. 

A metric space {X,p) is associated with a topological space (AT, r) in 
the obvious way: the open sets of r are precisely those subsets of X that 
are open relative to the metric p. In such a case we say that the metric p 
defines the topology r, and we identify the metric space (A, p) with the 
associated topological space (A, r). 

A topological space (A, t) is said to be metrisable if there is a metric p 
on A that defines the topology r. Not every topological space is metris- 
able. For example, if A = {0, 1} is given the topology r consisting of 0 and 
A itself, then every neighbourhood of 0 intersects every neighbourhood of 
1; if T were metrisable, then the distinct points 0, 1 of A would have dis- 
joint neighbourhoods — namely, B{0, and B{1, |). For characterisations 
of metrisable topological spaces see [25]. 

Let S' be a subset of a topological space A, and x £ X. We say that x is 
an interior point of S if there is an open set A such that x £ A C S; and 
we define the interior of S to be the set of all interior points of S. By a 
neighbourhood of x we mean a set U G X containing x in its interior. On 
the other hand, x is called a cluster point of S if each neighbourhood of 
X has a nonempty intersection with S; and we define the closure of S (in 
A) to be the set S of all cluster points of S. A subset C of A is said to be 
closed (in A) if it equals its closure. 

Propositions (1.3.2), (1.3.9), and (1.3.10), and the applicable parts of 
Exercises (1.3.7) and (1.3.8), all hold in the context of a topological space. 

(3.1.11) Exercises 

.1 Prove that the standard metric on R, and the metric induced on R 
as a subset of the extended real line R, give rise to the same topology 

on R. 

.2 Prove the statement immediately preceding this set of exercises. 

3.2 Continuity, Convergence, and Completeness 

In contrast to our approach to limits in Chapter 1, in the context of a metric 
space we first introduce the notion of continuity. The following definition is 
intended to capture formally the idea that f{x) is close to /(a) whenever 
X is close to a. 

Let A, Y be metric spaces, and / a mapping of A into Y. We say that 

/ is 
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• continuous at the point a € X if for each £ > 0 there exists 5 > 0 
such that p{f{a),f{x)) < e whenever x € X and p{a,x) < S; 

• continuous on X , or simply continuous, if it is continuous at each 
point of X. 

If / is not continuous at a € X, we say that / has a discontinuity at a, or 
that / is discontinuous at a. 

(3.2.1) Exercises 

.1 Prove that the identity mapping ix ■ X ^ X, defined on the metric 
space X by ix(x) = x, is continuous. 

.2 Prove that any constant mapping between metric spaces is continu- 
ous. 

.3 A mapping f : X ^ Y between metric spaces is said to be contractive 
if p{f{x), f{y)) < p{x, y) whenever x, y are distinct points of X. Prove 
that a contractive mapping is continuous. 

.4 Let X he a metric space, a G X, and /, g two functions from X into 
R that are continuous at a. Prove that the functions f + g, f — g, 
max{f,g}, min{/, g}, |/| , and fg are continuous at a. Prove also 
that if g{a) yf 0, then f /g is defined in a neighbourhood of a and is 
continuous at a. 

.5 Let F be a closed subset of a metric space X, and f ■ Y — >• R a 
bounded continuous mapping. Prove that 

X inf {f{y)p{x, y) -.y GY} 
is continuous on X\Y. (Note Exercise (3.1.10: 3).) 

.6 Let ft- be a mapping of R°^ into itself such that 

(i) h{t) = 0 if and only if t = 0, 

(ii) ft(s -I- t) < ft(s) -I- ft(t) for all s, t. 

Let p be a metric on a set X. Prove that d = ft o p is a metric on X, 
and that if ft is continuous at 0, then d is equivalent to p (see Exercise 
(3.1.3: 6)). Prove, conversely, that if X contains a point that is not 
isolated relative to p (see Exercise (3.1.8: 5), and if p and hop are 
equivalent metrics, then ft is continuous at 0. 

Taking h(t) = min {t, 1} in the first part of this exercise, we obtain a 
bounded metric equivalent to the given metric on X. 
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(3.2.2) Proposition. The following are equivalent conditions on a 
mapping f : X ^ Y, where X, Y are metric spaces. 

(i) / is continuous. 

(ii) For each open set A CY, is open in X. 

(iii) For each closed set A cY, f~^{A) is closed in X. 

Proof. Suppose that / is continuous, lei A d Y be an open set, and 
consider any a in f~^{A). Since /(a) € A and A is open, there exists e > 0 
such that B{f{a),e) C A. Choose <5 > 0 such that if p{a,x) < S, then 
p{f{a),f{x)) < e and therefore f{x) G A. Then B{a,S) C f~^{A). Hence 
f~^{A) is open in X, and therefore (i) implies (ii). 

Since a set is open if and only if its complement is closed, it readily 
follows that (ii) is equivalent to (iii). Finally, assume (ii), let a G X and 
£ > 0, and set A = B{f{a),e) C Y. Then A is open in Y, so f~^{A) is open 
in X. Since a G f~^{A), there exists <5 > 0 such that B{a, 6) C f~^{A); so 
if p{a,x) < 6, then f{x) G A and therefore p{f{a),f{x)) < s. Hence / is 
continuous at a. Since a G X is arbitrary, / is continuous on X. Thus (ii) 
implies (i). □ 

The preceding result says that a mapping between metric spaces is con- 
tinuous if and only if the inverse image of each open set is open. But the 
image of an open set under a continuous mapping need not be open: the 
continuous function cc i— >■ 0 maps each nonempty open subset of R onto 
the closed set {0} . Likewise, although the inverse image of a closed set 
under a continuous mapping is closed, the image of a closed set need not 
be: the mapping (x, y) x on the Euclidean space takes the hyperbola 
{(a;, y) : xy = 1} , a, closed set, onto the open set R\ {0} . 

(3.2.3) Proposition. Let X,Y,Z be metric spaces. If f : X ^ Y is 
continuous at a G X, and g '. Y -G Z is continuous at f{a), then the 
composite mapping go f : X ^ Z is continuous at a. If f is continuous on 
X and g is continuous on Y , then g o f is continuous on X. 

Proof. Suppose that / is continuous at a and that g is continuous at 
b = f{a). Let £ > 0. The continuity of g at 6 ensures that there exists 5' > 0 
such that if p{b,y) < S', then p{g{b),g{y)) < e. In turn, as / is continuous 
at a, there exists (5 > 0 such that if p{a, x) < S, then p{f{a),f{x)) < S'. So 
if p{a,x) < S, then p{b,f{x)) < S' and therefore p{g{b),g{f{x))) < £; that 
is, 

p{g° f{a), go f{x)) < £. 

Hence g o f is continuous at a. 

The second conclusion of the proposition follows immediately from the 
first. □ 
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Let S' be a subset of a metric space X, and a a limit point of S — that 
is, a point of the closure of S\ {a} . Let / be a mapping of S\ {a} into a 
metric space Y, and I a point of Y. We say that f{x) has a limit I as x 
tends to a in S if the mapping F : S U {a} — >■ Y defined by 

f f{x) Yx G S\ {a} 

F{x) = I 

I Z if a; = a 



is continuous at a relative to the subspace S U {a} of X. We then also use 
such expressions a,s I is a limit of the mapping f at a with respect to S, or 
f{x) converges to I as x tends to a in S, or f{x) tends to I as x G S tends 
to a. In that case we write 



lim f{x) 

~^a, x^S 



or 

f{x) -G-lasx^a, xGS, 

Note that in this definition it is not required either that a G S or that f{x) 
be defined at a; = a. 

In the special case where S = X we often write limj,-^^ f{x), rather than 

lim2,_>o^ xexf(x). 



(3.2.4) Exercises 

.1 Prove that the following condition is both necessary and sufficient 
for Z G F to be a limit of f{x) as x G S tends to a: for each £ > 0 
there exists 6 > 0 such that if a; G S' and 0 < p{a,x) < 6, then 
p(Z,/(a;)) < e. 

.2 Prove that a mapping / has at most one limit at a G (S\{a}) with 
respect to the subset S of X. (Thus we are safe in referring to “the” 
limit of / at a.) 

.3 Let a G X he a limit point of X. Prove that f : X ^ Y is continuous 
at a if and only if /(a) = lima,_>a, x ex fix). 

.4 Show that if I = lima,_>a_ f{x), then for each subset A of S such 
that a G A\{a}, Z is the limit of / at a with respect to A. 

.5 Show that if Z = lim^^a.xex f(x) and the mapping g : Y Z is 
continuous at Z, then g(l) = lima,_>a, ,xexgif{x)). 

.6 Prove that if Z = lim 2 ,_>o^ fix), then Z G /(S). 
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Using the metric on the extended real line R introduced in Section 1 of 
this chapter, we can handle the convergence of sequences in a metric space 
X as a special case of the convergence of functions. To this end, recall that 
a sequence ^ is really a mapping n i— of N'*' into X, and 

note that oo is a limit point of N’*' in R (see Exercise (3.1.3: 7)). If the 
mapping n i— >■ has a limit I at the point oo G R with respect to , we 

call I the limit of the sequence (x„), we say that the sequence (xn) converges 
to I as n tends to oo, and we write 

I = lim Xn 

n—^oo 

or 

^ as n — >■ oo. 

The next proposition shows, in particular, that on R our current notion 
of convergence of sequences coincides with the one introduced in Section 

1.2. 

(3.2.5) Proposition. In order that I = lim„_>ooX„, it is necessary and 
sufficient that for each £ > 0 there exist a positive integer N such that 
p{l,Xn) < s whenever n> N. 

Proof. By Exercise (3.2.4: 1), in order that a = lim„_>oo a;„, it is necessary 
and sufficient that for each £ > 0 there exist d > 0 such that if n G N”*" 
and 0 < p^(oo,n) < <5, then p{a,Xn) < £■ But 

p^(oo, n) = (n + 1)“^ > 0 , 

so p^(oo,n) < (5 if and only if n > fV, where N is the smallest positive 
integer > — 1. The desired conclusion now follows. □ 

In view of Proposition (3.2.5), we can easily adapt to the context of a 
metric space many of the elementary results about limits of sequences that 
were proved in the context of R in Chapter 1. We frequently do this without 
further comment. 

(3.2.6) Proposition. Let S he a subset of the metric space X , and a G 
X. In order that a € S, it is necessary and sufficient that a be the limit of 
a sequence of points of S. 

Proof. To prove the necessity of the stated condition, assume that a G S. 
Then for each positive integer n there exists a point a;„ in S' fl B{a, n~^). 
Since p{xn,a) < 1/N whenever n> N, the sequence (a;„) converges to a. 
The sufficiency part of the proposition is left as an exercise. □ 

(3.2.7) Proposition. Let (x„) he a sequence in X, and a G X. In order 
that there exist a subsequence of (xn) converging to a, it is necessary and 
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sufficient that for each neighbourhood U of a, Xn € U for infinitely many 
values of n. 

Proof. The condition is clearly necessary. Conversely, if it is satisfied, 
then we can construct, inductively, a strictly increasing sequence (rife) 
of positive integers such that € B{a,k~^) for each k. Since € 
B{a,k~^) whenever j > k, the subsequence {xn^f) of (x„) converges to the 
limit a. □ 

(3.2.8) Exercises 

.1 Prove the sufficiency of the condition in Proposition (3.2.6). 

.2 Prove that the subset A is dense in the metric space X if and only 
if for each x € X there exists a sequence (a;„) of points of A that 
converges to X. 

.3 Let A be a dense subset of X, and let /, g be continuous functions 
from X into a metric space Y such that f{x) = g{x) for all x in A. 
Prove that f{x) = g{x) for all x in X. 

.4 Let / be a mapping between metric spaces X and Y, and let a € 
X. Prove that / is continuous at a if and only if it is sequentially 
continuous at a, in the sense that /(x„) — >■ /(a) whenever (xn) is a 
sequence in X that converges to a. 

.5 Let X be a separable metric space, and / a mapping of X into R. 
For each pair of rational numbers q, q' let X^y be the set of t G X 
such that f{x) exists and 

f{t)<q<q'< lim /(a;). 

x—¥t, x^X 

Show that Xqy is either empty or countable. (Use Exercise (3.1.8: 5).) 
Hence prove that the set of points t G X such that fix) 

exists and does not equal f{t) is empty or countable. 

A sequence (x„) in a metric space X is called a Cauchy sequence if 
for each e > 0 there exists a positive integer N such that p{xm,Xn) < £ 
whenever m,n > N. Any convergent sequence is a Cauchy sequence: for if 
(xn) converges to a limit I, then, given £ > 0 and choosing N such that 
p{xn,l) < £/2 for all n > N, we use the triangle inequality to show that 
p{xm, Xn) < £ whenever m,n> N. 

We say that X is complete if each Cauchy sequence in X has a limit in 
X. We have already seen that R is complete (Theorem (1.2.10)). 

(3.2.9) Proposition. A complete subspace of a metric space is closed. 
A closed subspace of a complete metric space is complete. 
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Proof. Let S' be a subspace of the metric space X. li x & S, then by 
Proposition (3.2.6), there exists a sequence (a:„) in S that converges to x. 
Being convergent, (a:„) is a Cauchy sequence in S. So if S is complete, then 
{xn) converges to a limit s in S. By Exercise (3.2.4: 2), we then have x = s, 
so X £ S. Hence S = S — that is, S is closed in X. 

Conversely, suppose that X is complete and S is closed in X. If (x„) is a 
Cauchy sequence in S, then it converges to a limit x in X. By Proposition 
(3.2.6), X £ S = S. Hence S is complete. □ 

(3.2.10) Exercises 

.1 Prove that a sequence (cc„) in an ultrametric space is a Cauchy se- 
quence if and only if lim„_>oo p(a;n, a;„+i) = 0. Give an example to 
show that this is not the case in a general metric space. 

.2 Show that a Cauchy sequence (x„) in X is bounded, in the sense that 
{xn : n > 1} is a bounded subset of X. 

.3 Prove that if a Cauchy sequence (x„) has a subsequence that 
converges to a limit a, then — >■ a as n — >■ oo. 

.4 Prove that the interval I = (0, 1] is not complete with respect to 
the metric p induced by the usual metric on R. Define a mapping 
p' : / X / — >■ R by 

p'{x,y) = |y - i . 

Show that p' is a metric on I, that p and p' are equivalent metrics on 
I, and that (7, p') is complete. 

.5 Let A and B be complete subsets of a metric space. Give at least two 
proofs that AiJ B and AC\ B are complete. 

.6 Suppose that p{S, T) > 0 for any two disjoint closed subsets S, T of X. 
Prove that X is complete. (Suppose there exists a Cauchy sequence 
{xn) that does not converge to a limit in X. First reduce to the case 
where Xm ^ Xn whenever m ^ n. Then consider the sets {x 2 n : n > 1} 
and {x 2 n-i : > 1} ■) 

.7 Prove that if 77 is a nonempty set, then the metric space ,B(77, R) 
is complete. (See Exercise (3.1.1: 7). Given a Cauchy sequence (/„) 
in B{X,R.) and a positive number e, first show that for each x £ X, 
{fn{x))i^^i is a Cauchy sequence in R and therefore converges to a 
limit f{x) £ R. Then prove that the function / so defined is bounded, 
and that (/„) converges to / in the metric on ,B(77, R).) 

.8 Let 77 be a metric space, a £ X, and for all x,y £ X define 



<Px{y) = p{x,y) 
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and 

i" = Wa + /:/GS(X,R)}. 

Prove that 

(i) G y, 

(ii) the equation 

d{F,G) = sup{|y(x) — G{x) \ : x G X} 
defines a metric on Y, 

(iii) cc e- is an isometric mapping of X into Y, and 

(iv) the closure X of {(j)x '■ x G X} in Y is a complete metric space. 

We call the metric space {X, d) the completion of X. More generally, 
we say that a complete metric space X' is a completion of X if there is 
an isometry of X onto a dense subspace of X'; but as two completions 
of the same metric space X are isometric (why?), we commonly refer 
to any completion of X as “the” completion of X. 

We now arrive at the notion of uniform continuity, a natural strength- 
ening of continuity that, as we show in Theorem (3.3.12), turns out to be 
equivalent to continuity for certain very important spaces. 

We say that a mapping f : X ^ Y between metric spaces is uniformly 
continuous if for each £ > 0 there exists 6 > 0 such that p{f{x),f{y)) < e 
whenever x,y € X and p{x, y) < 6. 

(3.2.11) Exercises 

.1 Prove that a uniformly continuous mapping is continuous. Give an 
example of a continuous mapping on (0, 1] that is not uniformly 
continuous. 

.2 Let /, g be uniformly continuous mappings of X into R. Show that 
f + g, f — g, and fg are uniformly continuous on X. Show that if also 
infxex \f{x)\ > 0, then 1// is uniformly continuous on X. 

.3 Let f : X ^ Y and g '■ Y Z he uniformly continuous mappings 
between metric spaces. Show that go f is uniformly continuous on X. 

.4 Let S' be a nonempty subset of X. Show that the mapping x i— >■ p{x, S) 
is uniformly continuous on X. 

.5 Let (a„) be a sequence in X. Prove that the function 

x I— inf p(x, ttn) 

n>l 



is uniformly continuous on X. 
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.6 Let a be a positive number. A mapping / between metric spaces X 
and Y is said to satisfy a Lipschitz condition of order a, or to be 
Lipschitz of order a, if 

P if{x),f{y)) < {p{x, y))“ {x, y G X). 

Prove that such a mapping is uniformly continuous. 

.7 Prove that a mapping / between metric spaces X^Y is uniformly 
continuous if and only if p{f{S),f{T)) = 0 whenever S,T C X and 
p{S,T) = 0. 

.8 Prove that if X is not complete, then there exists a uniformly 
continuous mapping of X into with infimum 0. (See Exercise 
(3.2.10:8).) 

.9 Prove that if X is not complete, then there exists an unbounded 
continuous mapping of X into R. 

.10 Suppose that every continuous mapping of X into R is uniformly 
continuous. Prove that X is complete. (Assume that A is a dense 
subset of its completion X, as defined in Exercise (3.2.10:8), and 
that there exists a Cauchy sequence of elements of X converging to 
Xao G X\X. Consider the function x l/p(x,Xca) on X.) 



(3.2.12) Proposition. Let D he a dense subset of a metric space X , and 
f a uniformly continuous mapping of D into a complete metric space Y. 
Then there exists a unique continuous mapping F of X into Y such that 
F{x) = f{x) for all x in D; moreover, F is uniformly continuous on X. 

Proof. For each £ > 0 there exists <5 > 0 such that p{f{x),f{x')) < e 
whenever p{x, x') < S. Given x in X, let (x„) be a sequence in D converg- 
ing to X. Since for each e > 0 there exists N such that p{xm,Xn) < 5, and 
therefore p{f{xm),f{xn)) < £, whenever m,n> N, we see that (/(a;„)) is 
a Cauchy sequence in Y. As the latter space is complete, (/(x„)) con- 
verges to a limit f in Y. Moreover, if (x'^f) is another sequence in D 
converging to x, then lim„_>oo f{x'„) = for, replacing (a;„) by the se- 
quence (xi,x'i,X 2 ,X 2 , ■ ■ ■) in the foregoing argument, we can show that 
(f{xi),f{x[),f{x 2 ), f{x 2 ), . . .) is a Cauchy sequence; since the subsequence 
(f{xn)) converges to we conclude from Exercise (3.2.10:3) that the se- 
quence {f{xi),f{x'i),f{x 2 ), f{x 2 ), ■ ■ ■), and hence the subsequence (/(a:(j)), 
converges to Thus 

F{x) = C = lim f{xn) 

n—^oo 

is an unambiguous definition of a function F from X into Y. If x G D, then 
{x, X, . . .) is a sequence in D converging to x, so F{x) = f{x). 
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To prove that F is uniformly continuous, consider x,x' in X such that 
p{x, x') < 5, and let (x„) and (x^) be sequences in D converging to x and 
x' , respectively. Then (/(x„)) and {f{x'^)) converge to F{x) and F{x'), 
respectively. So for all sufficiently large n we have p{F{x), f{xn)) < s, 
p{F{3:'),f{x'^)) < e, and p{xn,x'„) < S; whence p(/(a:„), /(x'„)) < £, and 
therefore, by the triangle inequality, p{F{x), F{x')) < 3e. Thus F is uni- 
formly continuous on X. Finally, the uniqueness of F is an immediate 
consequence of Exercise (3.2.8: 3). □ 

The foregoing result enables us to extend uniformly continuous functions 
from dense subsets to the whole space. We close this section with a famous 
theorem that enables us to extend continuous real-valued functions from 
closed subspaces to the whole space. 

(3.2.13) The Tietze Extension Theorem. Let X be a metric space, 
Y a closed subspace of X, and f a bounded continuous mapping of Y into 
R. Then there exists a bounded continuous mapping F : X — >■ R such that 

(i) F{y) = f{y) for all y€Y, 

(ii) F{x) = inf^^gy f{y), and 
(hi) sup,,gjf F{x) = supj^gy f{y). 



Proof. We may assume that / is not constant. Let h be an increasing 
function of the form x ax + b mapping the interval [inf /, sup /] onto 
[1, 2]; replacing / by hof, if necessary, we reduce to the case where inf / = 1 
and sup f = 2. Since Y is closed, p(x, F) > 0 for all x G X\Y (Exercise 
(3.1.10:3)), and so 



F{x) 



f{x) \i X & Y 



p{x,Y) 



defines a function F : AT — >■ R that coincides with f on Y. To prove that F 
satisfies (ii) and (hi), we need only show that 1 < F(x) < 2 for all x G X\Y. 
For such X and all y G F we have 



F{x) < 



‘^P(x,y) 

p{x,Y) ■ 



So, given e > 0 and choosing y €Y such that 

p(x,y) < (l + I) p(x,Y), 
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we obtain F{x) <2 + e. On the other hand, choosing y' &Y such that 



, ^ P{x,y') ^ f{y')p{x,y') 
- p{x,Y) - p(x,Y) 



< F{x) + e, 



we see that F{x) > 1 — e. As e > 0 is arbitrary, it follows that 1 < F{x) < 2. 

Since / is continuous on Y° , so is F. Also, the function x !->• p{x,Y) 
is uniformly continuous on X\Y, by Exercise (3.2.11:4); so, by Exercises 
(3.2.1: 5 and 4), F is continuous on X\Y. It therefore remains to prove the 
continuity of F at any ^ G E fi X\Y. Given £ > 0, choose r > 0 such that 
if y GY and p{^,y) < r, then |/(^) — f{y)\ < e. It suffices to prove that if 
X G X\Y and p{x,^) < r/4, then 



F{0-£< F{x) < F{S)+e. 



( 1 ) 



To this end, observe that for each y G Y\B{^,r), 

p{x, y) > p{^, y) - p{x, 0 > ^ > 2p{x, 0> P{x,Yn B{^, r )) , 



SO 

f{y)p{x,y) > ^ > fi0p{x,0 > inf f (rj) p{x , rj) . 

^ V&YnB{^,r) 

It follows that 

p{x,Y)=p{x,YnB{^,r)) (2) 

and that 

inf /( 2 /)p(a:, 2 /) = inf f (y) p{x , y) . (3) 

yeY i/GVnB(c,r) 

For each y G Y D B{^,r) we have 

fiO-s<f{y) <fiO + s 



and therefore 



(/(C) - s) P{x, Y) < f{y)p{x, y) < (/(^) + e) p{x, y). 

Hence 

(/(C) -e)p(a;,T) < inf f(i/)p(a;, y) < (/(^) + £) p(a;, E Cl H($, r)), 

yGVnS(C,r) 

and so, by (2) and (3), 

(/(C) - e) P{x, Y) < inf f{y)p{x, y) < (/(C) + e) p{x, Y). 

yeY 

Dividing through by p{x,Y), we obtain the desired inequalities (1). □ 

The mapping F in Theorem (3.2.13) is called a continuous extension of 
/ to A. 
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(3.2.14) Exercises 

.1 Give two proofs of Urysohn’s Lemma: if S', T are nonempty disjoint 
closed subspaces of a metric space X, then there exists a continuous 
mapping / : X — >• [0, 1] such that /(S) = {0} and /(T) = {1} . (For 
one proof, note that p{x, S) + p{x, T) > 0 for all x £ X.) 

.2 Let F be a closed subspace of a metric space X, and / a continuous 
mapping of Y into R. Prove that there exists a continuous extension 
E : X — >■ R of /. (First apply Theorem (3.2.13) to g o f for some 
suitable function g.) 

.3 Suppose that for each pair S, T of nonempty disjoint closed subsets 
of X there exists a uniformly continuous mapping / : X — >■ [0, 1] such 
that f{S) = {0} and f{T) = {1} . Prove that X is complete. 

.4 Show that the following are equivalent conditions on X. 

(i) Every continuous function / : X -4 R is uniformly continuous. 

(ii) p{S, T) > 0 for all nonempty disjoint closed subsets S, T of X. 

(To prove that (ii) implies (i), suppose that / : X — >• R is continuous 
but not uniformly continuous. Then there exist sequences (xn), (pn) 
in X and a positive number a such that lim„_>oo p(a;„, j/„) = 0 and 
\f{xn) — f{yn)\ > a for all n. Consider the sets S = {x„ : n > 1} and 
T = {yn:n>l}.) 



3.3 Compactness 

In the context of a metric space, the various notions associated with the 
word compactness represent different generalisations of, and approxima- 
tions to, finiteness. 

Let S' be a subset of a metric space (X, p). By a cover of S we mean a 
family U of subsets of X such that S C IJ W; we then say that S is covered 
by U, and that U covers S. If also each U G W is an open subset of X, we 
refer toU as an open cover of S. On the other hand, if W is a finite set, we 
call it a finite cover of S. By a subcover of 14 we mean a subfamily T oill 
that covers S. 

A metric space X is called compact, or a compact space, if every open 
cover of X contains a finite subcover. By a compact set in a metric space 
X we mean a subset of X that is compact when considered as a metric 
subspace of X. 

Note that we can apply our definition of compactness to a topological 
space X, even if the topology of X is not metrisable. 
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The Heine-Borel-Lebesgue Theorem (1.4.6) shows that a bounded closed 
interval in R is compact. 

(3.3.1) Proposition. A compact subset of a metric space is separable 
and bounded. 

Proof. Let S' be a compact subset of a metric space X. We may assume 
that S is nonempty. For each positive integer n the family (B{s,n~^))seS 
of open balls is an open cover of S, so there exists a finite subset of S 
such that S is covered by the balls B{s,n~^) with s G Fn- It follows that 
the countable set U^i is dense in S, which is therefore separable. 

Now fix Si G fi, and define the nonnegative number 

R = max{p(s, si) : s G S’!}. 

For each x £ S choose s G Fi such that p{x, s) < 1; then 
p{x, si) < p{x, s) + p{s, si) < 1 + R. 

Hence S is bounded. □ 

(3.3.2) Proposition. A compact set in a metric space is closed. 

Proof. Let S' be a compact subset of a metric space X. We may assume 
that X\S is nonempty. If a G X\S, then for each s G S, 

0 < Ts = p{a, s). 

The open balls B{s, ^r^), with s G S, form an open cover of S, so there 
exists a finite subset F of S such that S is covered by the balls B{s, ^rg) 
with s £ F. Define the positive number 

r = min{rs : s G F}. 

For each x £ S choose s £ F such that x G B{s, |rg); then 
p(a,x) > p{a,s) - p{x,s) 

>rs- ^rg 

>lr. 

It follows that B{a, |r) C X\S and therefore that a is an interior point 
of X\S. Since a is any point of X\S, we conclude that X\S is open and 
therefore that S is closed. □ 



(3.3.3) Proposition. A compact metric space is complete. 
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Proof. Let AT be a compact metric space, and X its completion (Exercise 
(3.2.10:8)). By Proposition (3.3.2), AT is a closed subspace of X. Since X 
is complete, it follows from Proposition (3.2.9) that X is complete. □ 

(3.3.4) Proposition. A dosed subset of a compact metric space is 
compact. 

Proof. Let S' be a closed subset of a compact metric space X, and let U 
be an open cover of S. By Proposition (3.1.5), for each U G 14 there exists 
an open set Vu in X such that U = S C\Vu- Then X\S and the sets Vu, 
with U G U, form an open cover of X. Since X is compact, there exist 
finitely many sets Ui,. . . ,Un in U such that 

{X\S} U {P[/i, . . . , V(/„} 

is an open cover of X. Clearly, {[/i, . . . , [/„} covers S and so is a finite 
subcover ofU; whence S is compact. □ 

(3.3.5) Exercises 

.1 Find an alternative proof of Proposition (3.3.2). 

.2 Find an alternative proof of Proposition (3.3.3). (Suppose that X is 
compact but not complete, and let (a;„) be a Cauchy sequence in X 
that does not converge and therefore has no convergent subsequence. 
Then for each x G X there exist > 0 and G N”*" such that 
p{xn,x) > Xx for all n > Nx- Cover X by finitely many of the balls 
B{x, \rx).) 

.3 Prove that a subset of the Euclidean space R" is compact if and only 
if it is bounded and closed. 

.4 A family T of subsets of a set X is said to have the finite intersection 
property if every finite subfamily of T has a nonempty intersection. 
Prove that a metric space X is compact if and only if every family 
of closed subsets of X with the finite intersection property has a 
nonempty intersection. 

.5 Let AT be a compact subset of an open set U C X. Prove that there 
exists r > 0 such that if p{x, K) < r, then x G U. 

.6 Prove that any open cover of a separable metric space has a countable 
subcover. (This is a special case of Lindeldf’s Theorem; see page 72 
of [47].) 

(3.3.6) Proposition. If f is a continuous mapping of a compact metric 
space X into a metric space Y , then f{X) is a compact set. 
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Proof. Let U be an open cover of f{X). By Proposition (3.2.2), the family 
is an open cover of X . Since X is compact, there is a finite 
set T (ZU such that (/“^(C/))^^^ is an open cover of X. Then T is an 
open cover of f{X), which is therefore compact. □ 



(3.3.7) Exercises 

.1 Prove that a continuous mapping / of a compact metric space X into 
R is bounded. Prove also that / attains its hounds , in the sense that 
there exist points a, 6 in X such that /(a) = inf / and f{b) = sup/. 

.2 Prove that a continuous mapping of a compact space X into has 
a positive infimum. 

.3 Prove that if / is a continuous one-one mapping of a compact metric 
space X onto a metric space Y , then the inverse mapping f~^ : Y — >■ 
X is continuous. (Use Proposition (3.2.2).) 

.4 A mapping / of a set X into itself is called a self-map of A. By 
a fixed point of such a mapping we mean a point x € X such that 
f{x) = X. Let / be a contractive self-map of a compact metric space 
X (see Exercise (3.2.1: 3)). Prove that the mapping x i-l p{x,f{x)) of 
X into R is continuous. Applying Exercise (3.3.7: 1) to this mapping, 
deduce that / has a fixed point {Edelstein’s Theorem). Prove that 
there is no other fixed point of /. 



There are other properties of a metric space X that capture the idea of 
approximate finiteness and are intimately related to compactness. We say 
that X is 

• sequentially compact if every sequence in X has a convergent 
subsequence; 

• totally bounded, or precompact, if for each e > 0 there exists a finite 
cover of X by subsets of diameter < e. 

Sequential compactness, like compactness, is a topological concept, whereas 
total boundedness is a metric notion. An analogue of sequential compact- 
ness can be defined for a general topological space; see under “filters” in [7] 
or [47]. For a nonmetric analogue of total boundedness we need the context 
of a uniform space, which is also discussed in [7] and [47]. 

The total boundedness of a metric space X can be expressed differently. 
By an e-approximation to X we mean a subset S' of A such that p{x, S) < s 
for each a; G A. It is easy to show that A is totally bounded if and only 
if for each £ > 0 it contains a finite e-approximation. Note that since the 
empty set is regarded as finite, it is also totally bounded. 
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Corollary (1.2.8) shows that a bounded closed subset of R is sequentially 
compact. 



(3.3.8) Exercises 

.1 Prove that a bounded interval in R is totally bounded. 

.2 Prove that a subset of a totally bounded metric space is totally 
bounded. 

.3 Prove that if a metric space is either sequentially compact or totally 
bounded, then it is bounded. 

.4 Show that a totally bounded metric space is separable. 

.5 Let / be a uniformly continuous mapping of a totally bounded metric 
space into a metric space. Prove that the range of / is totally bounded. 

.6 Let X be a metric space that is not totally bounded. Prove that 
there exist a sequence (x„) in X and a positive number a such that 
p{xm, Xn) > cx. whenever m ^ n. 

.7 Let / be a function of bounded variation on a compact interval / c R. 
Prove that /(/) is totally bounded. (Use the preceding exercise.) 

.8 Let AT be a metric space that is not totally bounded, and choose 
(xn) and a as in Exercise (3.3.8: 6). For each n construct a uniformly 
continuous function </>„ : — >■ [0, 1] such that (i) 4>n{xn) = 1 and (ii) 

4>n{x) = 0 if p{x, Xn) > a/3. Given any sequence (c„) of real numbers, 
show that / = Cn4>n is a well-defined continuous function on 

X, and that if (c„) is bounded, then / is uniformly continuous on X. 

.9 Let (X,p) be a separable metric space. Show that there exists on X 
a metric d equivalent to p, such that {X, d) is totally bounded. (Let 
(Xn) be a dense sequence in X, and use Exercise (3.2.1: 6) to reduce 
to the case where p < 1. Define 

OO 

d{x,y) = ^ 2"” \p{x,Xn) - p{y,Xn)\ 

n—1 



for all x,y £ X.) 

We now arrive at a fundamental theorem linking compactness, sequential 
compactness, and total boundedness. 

(3.3.9) Theorem. The following are equivalent conditions on a metric 
space {X, p) . 
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(i) X is compact. 

(ii) X is sequentially compact. 

(iii) X is totally hounded and complete. 

Proof. First, let X be a compact metric space, and (a;„) a sequence in 
X. For each n let be the closure of {a;„, x„+ 2 , . . .} in X. It is 

easy to show that (Fn)^! has the finite intersection property. By Exercise 
(3.3.5: 4), n^i contains a point a. Consider any neighbourhood U of a. 
For each n, since a £ Fn, there exists m> n such that Xm G U. It follows 
that U contains Xk for infinitely many values of k; whence, by Proposition 
(3.2.7), there exists a subsequence of (x„) converging to a. Thus (i) implies 
(ii). 

Next, let X satisfy (ii). Then any Cauchy sequence in X has a conver- 
gent subsequence and so converges to a limit in X, by Exercise (3.2.10: 3); 
whence X is complete. Suppose that X is not totally bounded. Then, by 
Exercise (3.3.8: 6), there exist a sequence (a;„) in X and a positive number 
a such that p{xm,Xn) > a whenever m ^ n. Clearly, (x„) has no Cauchy 
subsequences and therefore no convergent subsequences. This contradicts 
our assumption (ii); so, in fact, X is totally bounded. Thus (ii) implies (iii). 

It remains to prove that (iii) implies (i). Accordingly, let X be totally 
bounded and complete, and suppose that there exists an open cover U of 
X that contains no finite subcover. With Bq = X, we construct a sequence 
of closed balls in X such that for each n > 1, 

(a) i?„ has radius 2“", 

(b) Bn has a nonempty intersection with Bn-i, and 

(c) no finite subfamily of W is a cover of B„_i. 

Having constructed Hq, . . . , Bn-i with the applicable properties, let 
be a finite cover of Bn-i by balls in Bn-i of radius 2“”. (Note that Bn-i 
is totally bounded, by Exercise (3.3.8: 2).) Amongst the sets Vj there exists 
at least one — call it — that is not covered by finitely many of the sets in 

U'. otherwise each of the finitely many sets Vj, and therefore Bn-i, would 
be covered by finitely many elements of U, thereby contradicting (c). This 
completes the inductive construction of 

For each n > 1 let be the centre of Since nH„_i is nonempty, 
it follows from the triangle inequality that for n >2, 

p(x„,a:„_i) <2-" + 2-"+i <2-"+2. 

So if j > i > > Ij then 

3 

p{x^,Xj)< ^ p{xk,Xk-l) 
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oo 

< 2 “*+^ ^ 2 “'" = 2 “*+^ < 2 “^+^. 
k=0 

Hence (a:„) is a Cauchy sequence in X and so, as X is complete, converges 
to a limit x^o in X. Now pick U € U such that x^o € U . Since U is open, 
there exists r > 0 such that B{xao,r) C U. Choosing N > 1 such that 
p{xoo,xn) < f/2 and 2~^ < rj2, we see that for each x G B^q, 

p{x, Xoo) < p{x, xn) + p(xoo,xn) < 2“^ + r/2 < r, 

so X £ B{xoo,r)- Hence Bn C B{xoo,r) C U, which contradicts (c). It 
follows that our initial assumption about the open cover U is false; whence 
X is compact, and therefore (iii) implies (i). □ 

The proof that (iii) implies (i) in Theorem (3.3.9) is a generalisation of 
the argument we used to prove the Heine-Borel-Lebesgue Theorem (1.4.6). 

(3.3.10) Exercises 

.1 Use sequential compactness arguments to show that a compact subset 
of a metric space is both bounded and closed. 

.2 Show that if X is compact, then there exist points a,b of X such that 
p{a,b) = diam(Jf). 

.3 Let A, B be nonempty disjoint subsets of a metric space X with A 
closed and B compact. Give two proofs that p{A, B) > 0. 

.4 Let (S'n) be a descending sequence of compact sets in a metric space 
X (so 5i D 52 U • • •). Prove, in at least two different ways, that if 
5„ yf 0 for all n, then H^i ^ 0- 

.5 Let X be a compact space in which each point x is isolated (see 
Exercise (3.1.8: 5)). Give at least two proofs that X is finite. 

.6 Prove that if every continuous mapping of X into R is bounded, then 
X is compact. (First suppose that X is not totally bounded, and use 
Exercise (3.3.8: 8) to construct an unbounded continuous mapping of 
X into R. Then use Exercise (3.2.11:9).) 

Is this true if “continuous” is replaced by “uniformly continuous” in 
the hypothesis? 

.7 Prove that if every uniformly continuous mapping of X into R^ has 
a positive infimum, then X is compact, (cf. Exercise (3.3.7: 2). Use 
Exercises (3.3.8: 8), (3.2.11:5), and (3.2.11:8).) 
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.8 Let {X, p) be a metric space, and suppose that X is complete with 
respect to every metric equivalent to p (see Exercise (3.1.3: 6)). Prove 
that X is compact. (Suppose that X is not totally bounded. By Exer- 
cise (3.3.8: 6), there exist a sequence (x„) in X and a positive number 
a such that p{xm, a:„) > a whenever m ^ n. Show that 

d{x,y) = min \p{x,y), inf [p{x,x„,) + \^ - ^\a + p{y,Xn)}\ 

I m,n>l \ UL !L \ j I 

defines a metric equivalent to p with respect to which X is not 
complete.) 

.9 Prove that the following are equivalent conditions on a metric space 
{X,p). 

(i) If d is a metric equivalent to p, and S', T are disjoint closed 
subsets of (X,d), then d{S,T) > 0. 

(ii) X is compact. 

(Use Exercises (3.3.10:3), (3.2.10:6), and (3.3.10:8); also, note the 
guide to the solution of Exercise (3.3.5: 2).) 

The following property of a metric space X is known as the Lehesgue 
covering property. 

For each open cover U oi X there exists r > 0 such that any 
open ball of radius r in X is contained in some U G U. 

The positive number r associated with the open cover U in this way is 
called a Lehesgue number for U. 

(3.3.11) Proposition. A compact metric space has the Lehesgue cover- 
ing property. 

Proof. Let X be a compact metric space, and 14 an open cover of X. For 
each X G X choose r^ > 0 such that B{x,2rx) C U for some U G 14. The 
balls B(x,rx), with x G X, form an open cover of X, from which we can 
extract a finite subcover, say 

{B{xi,rxi) : I <i <n} . 

Then 

0 < r = min{rxi,. ■ ■ XxA ■ 

Given x G X, choose i such that x G B{xi,rxi). Then for each y G B{x,r) 
we have 

p{y,Xi) < p{x,y) p{x,x^) <r-\-rx, < 2rx,. 
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So 



B{x,r) C B{xi,2r^.) C U 



for some U €U. □ 



The implications (i) (ii) (iii) of the first part of the next result — a 
general version of the Uniform Continuity Theorem for metric spaces — are 
well known, in contrast to the implication (iii) (i), which is due to Wong 

[55]. 

(3.3.12) Theorem. The following are equivalent conditions on a metric 
space X. 

(i) X has the Lehesgue covering property. 

(ii) Every continuous mapping of X into a metric space is uniformly 
continuous. 

(iii) Every continuous mapping of X into R is uniformly continuous. 



Proof. Assuming (i), let / be a continuous mapping of X into a metric 
space, and let £ > 0. For each t £ X there exists > 0 such that if 
p{x,t) < 5t, then p{f{x),f{t)) < e/2. It follows from the triangle inequality 
that if X and y belong to B{t,6t), then p{f{x),f{y)) < e. Let 5 > 0 be a 
Lebesgue number for the open cover (B{t,St))^^x ^ V 

points of X such that p{x,y) < 6, then both x and y belong to B{x,S), 
which is a subset of B{t,St) for some t; so p{f{x),f{y)) < e. Thus / is 
uniformly continuous, and therefore (i) implies (ii). 

It is trivial that (ii) implies (iii). To complete the proof, suppose that X 
does not have the Lebesgue covering property; so there exists an open cover 
lA of X for which there is no Lebesgue number. For each positive integer 
n we can therefore construct Xn £ X such that B{xn,n~^)\U is nonempty 
for each U £U. Then there exists G B(a;„, n“^)\ {x„} : for otherwise 
we would have 

B(^Xn^ n ) — £- U 

for some U £14. We show that 

neither (a;„) nor (?/„) has a convergent subsequence. (1) 

Indeed, if (Xn) had a subsequence that converged to a limit f £ X, then, 
choosing U £ U such that f £ U, we would have B(xn,n~^) C U for 
some n, a contradiction. On the other hand, if {ynk)’k=i were a convergent 
subsequence of (j/n), then the subsequence {xn^} of (a:„) would converge to 
the same limit, which contradicts what we have just proved. 
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Setting Til = 1, suppose we have constructed rii < U 2 < ■ ■ ■ < rik such 
that the sets 



Sk — j • ■ • j } 5 

Tk = {j/ni ) ■ ■ • ) Vrik } 

are disjoint. There exists Uk+i > Uk such that ^ Sk and yuk+i ^ Tk'. 

otherwise we would have either xj G Sk for infinitely many j or else yj G Tk 
for infinitely many j; since Sk and Tk are finite, this would imply that either 
{xn) or (y„) had a convergent subsequence, thereby contradicting (1). Thus 
we have inductively constructed a strictly increasing sequence of 

positive integers such that the sets 

S = {Xn^ : fc > 1} , 

T = {yuk : fc > 1} 

are disjoint. These sets are both closed in X-. for example, any point of 
S\S would be the limit of some subsequence of (a:„), which contradicts 
(1). Applying Urysohn’s Lemma (Exercise (3.2.14: 1)), we now construct a 
continuous function / : A — >• [0, 1] such that f{S) = {0} and f{T) = {1} . 
Since p{x„,^,yn^) < 1/uk but |/(a:„J - /(ynj,)! = 1, the function / is not 
uniformly continuous. Hence (iii) implies (i). □ 

(3.3.13) Corollary — The Uniform Continuity Theorem. Every 
continuous mapping of a compact metric space into a metric space is 
uniformly continuous. 

Proof. This follows from the preceding two results. □ 

The converse of Corollary (3.3.13) is not true, since every function from 
the discrete metric space N to R is uniformly continuous but N, being un- 
bounded, is not compact. However, there is an interesting partial converse 
to Corollary (3.3.13), which we discuss in Section 4. 

(3.3.14) Exercises 

.1 Use a sequential compactness argument to prove that a compact 
metric space has the Lebesgue covering property. 

.2 Give an example of a totally bounded metric space for which the 
Lebesgue covering property does not hold. 

.3 Prove that X has the Lebesgue covering property if and only if for 
each nonempty closed set S' C A and each open set U containing S, 
there exists r > 0 such that the r -enlargement of S, 



B{S, r) = {x G X : p{x, S) < r} , 
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is contained in U. (For “only if”, consider the open cover {AT\S', U} 
of X. For “if”, suppose that X does not have the Lebesgue cov- 
ering property and, as in the second part of the proof of Theo- 
rem (3.3.12), construct disjoint nonempty closed subsets S,T of X 
such that p{S,T) = 0; then show that there exists r > 0 such that 
B{S,r)cX\T.) 

.4 Prove that a metric space with the Lebesgue covering property is 
complete. Need it be totally bounded? 

.5 Let X have the Lebesgue covering property, and let F be a closed 
subset of X. Give two proofs that Y has the Lebesgue covering prop- 
erty. (For one proof, use the Tietze Extension Theorem; for another, 
work directly with an open cover of Y.) 

.6 Prove the Uniform Continuity Theorem using sequential compactness 
without the Lebesgue covering property. 

.7 Let A1 be a metric space, and h a mapping of X into a compact 
metric space Y. Suppose that / o ft, is uniformly continuous for each 
continuous (and therefore uniformly continuous) mapping / : F — ^ R. 
Give at least two proofs that ft is uniformly continuous. 



The notion of compactness can be generalised in a number of ways. The 
one we deal with is typical of topology, in that it replaces a global property 
(one that holds for the whole space) by a local one (one that holds in some 
neighbourhood of any given point). 

A metric space X is said to be locally compact, or a locally compact 
space, if each point in X has a compact neighbourhood in X. For example, 
although (in view of Proposition (3.3.1)) R is not compact, it is locally 
compact: if x G R, then [x — 1, x -I- 1] is a compact neighbourhood of x in 
R. Of course, a compact metric space is locally compact. 

(3.3.15) Proposition. Let X he a locally compact space, and S a subset 
of X. If either S is open or S is closed, then S is locally compact. 

Proof. Let a G S, and choose a compact neighbourhood K of a in AT. If 
S is open, then 

aG {KnS)° = K° ns°, 

so there exists r > 0 such that B{a,r) C K and B{a,r) C S. As B{a,r) is 
closed in X, it is closed in K (by Proposition (3.1.5)) and therefore compact 
(by Proposition (3.3.4)). Hence a has a compact neighbourhood in S, and 
so S is locally compact. 

Now suppose that S is closed in X. Since iF is a neighbourhood of a in 
X, AT n S' is a neighbourhood of a in S (Exercise (3.1.6: 4)). Also, K (1 S is 
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closed in K, by Proposition (3.1.5), and therefore compact, by Proposition 
(3.3.4). Hence S is locally compact. □ 



(3.3.16) Exercises 

.1 Let S and T be locally compact subspaces of a locally compact metric 
space X. Prove that S' fl T is locally compact. Need S U T be locally 
compact? 

.2 Is every locally compact space complete? 

.3 Let X be a metric space in which every bounded set is contained in 
a compact set. Prove that X is locally compact and separable. 

.4 Let X be locally compact, and K a compact subset of X. Prove that 
for some r > 0 the closure of the r-enlargement of K is compact. 
(See Exercise (3.3.14:3).) 

.5 Let X be a separable locally compact metric space. Show that there 
exists a sequence (P„) of open subsets of X, each of which has compact 
closure, with the property that for each x G X and each neighbour- 
hood U of X there exists n such that x G Vn C U. Hence prove that 
there exists a sequence ([/„) of open subsets of X with the following 
properties. 

(i) U„ is compact; 

(h) Un G1 

(hi) X = U“l Un- 

(Set Ui = Vi and Un+i = Vn+i U H([/„,r), where, using Exercise 
(3.3.16: 4), r > 0 is chosen to make the closure of B{Un, r) compact.) 

.6 Let X be a separable locally compact metric space that is not com- 
pact, and let (C/„) be as in the preceding exercise. Use Urysohn’s 
Lemma (Exercise (3.2.14: 1)) to show that there exists a continuous 
function / : X — >• R such that f{x) < n for all x G Un, and f{x) > n 
for all X G X\Un- Then show that 

d{x,y) = p{x,y) + \f{x) - f{y)\ 

defines a metric d equivalent to p, and that in the space {X, d) any 
bounded set is contained in a compact set. 
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3.4 Connectedness 

In analysis there are many situations where progress is made by restrict- 
ing attention to parts of a metric space that cannot be split into smaller, 
separated parts. Our next definition captures this imprecise idea formally. 

A metric space is said to be connected, or a connected space, if it can 
not be expressed as a union of two disjoint nonempty open subsets. So 
if X is connected, and if S, T are nonempty open subsets of X such that 
S' U T = X, then S fl T yf 0. A subspace that is connected is called a 
connected set in the metric space. Clearly, the empty subset of any metric 
space is connected. 

(3.4.1) Proposition. The following are equivalent conditions on a 
metric space X. 

(i) X is connected. 

(ii) X is not a union of two disjoint nonempty closed subsets. 

(iv) The only subsets of X that are both open and closed in X are X and 
the empty subset. 

Proof. The straightforward proof is left as the next exercise. □ 

(3.4.2) Exercises 

.1 Prove Proposition (3.4.1). 

.2 Prove that a metric space X is connected if and only if there is no 
continuous mapping of X onto {0,1}. 

We showed in Proposition (1.3.13) that the only subsets of R that are 
both open and closed are R and 0. It follows from Proposition (3.4.1) that 
R is connected. In fact, we can say more. 

(3.4.3) Proposition. A nonempty subset of R is connected if and only 
if it is an interval. 

Proof. Let S' be a nonempty subset of R, and suppose first that S is 
connected. Let a, b be points of S with a < b, and consider any x such that 
a<x<b.lfx^S, then S is the union of the disjoint subsets S fl (— oo, x) 
and S n (x,oo), each of which is open in S, by Proposition (3.1.5). This 
contradicts the assumption that S is connected. So a: G S, and therefore S 
has the intermediate value property. Hence, by Proposition (1.3.3), S is an 
interval. 

Now let S be an interval in R, and suppose that S is not connected. 
Then there exist nonempty open subsets A, B of the subspace S such that 
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S = AVJ B and AC\ B = 0. We may assume that there exist a G A and 
b G B such that a < b. Let x be the supremum of the nonempty bounded 
set A n [a, b), and suppose that x G A. Then a<x<b, asb^ A. Since A 
is open in S, there exists r > 0 such that S' fl [x, x + r] d A C\ [a, b). Being 
an interval, S has the intermediate value property (Proposition (1.3.3)), so 
[a, &] C S, and therefore [x,x + r] C S. Hence x + r G Ar\[a,b), which 
contradicts the definition of x. Thus, in fact, x (fi A. K similar argument 
shows that x (ji B , which is absurd since, as we have already observed, 
[a, b] C S. This contradiction shows that S is connected. □ 

(3.4.4) Exercise 

Let S, T be nonempty closed subsets of a metric space X such that 
S U T and S fl T are connected. Prove that S and T are connected. 
Give an example to show that the conclusion no longer holds if we 
remove the hypothesis that S and T are closed. 

We now prove some general results about connected spaces. 

(3.4.5) Proposition. If S,T are subsets of a metric space X such that 
S is connected and S C T C S, then T is connected. In particular, S is 
connected. 

Proof. Suppose that A, B are nonempty open sets in the subspace T such 
that T = Ad B and H fl H = 0. As S' is dense in T, both S fl A and S C\B 
are nonempty. They are clearly disjoint, and, by Proposition (3.1.5), they 
are open in S. Since S = (S fl A) U (S fl H), we have contradicted the fact 
that S is connected. □ 

(3.4.6) Proposition. If if is a family of connected sets in a metric space 
X such that p| iF is nonempty, then IJ iF is connected. 

Proof. Let S = IJ .F and a G p| F. Suppose that S = AU B, where 
A, B are nonempty disjoint open sets in S. Consider, for example, the case 
where a G A. Choose F G T such that B D F is nonempty, and note that 
a G An F. Then A fl F and B C\ F are open in F (by Proposition (3.1.5)), 
have union F, are disjoint, and are nonempty. This contradicts the fact 
that F is connected. □ 

(3.4.7) Exercises 

.1 Let S,T be connected subsets of a metric space X such that S' fl T 
is nonempty. Prove that S U T is connected. 

.2 Let (S„) be a sequence of connected subsets of a metric space X such 
that S„nS„+i is nonempty for each n. Prove that IJ S„ is connected. 
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.3 A metric space X is said to be chain connected if for each pair a, b 
of points of X, and each £ > 0, there exist finitely many points 
a = Xq, Xi, . . . ,Xn = b such that p{xi, Xi+i) < e: for z = 0, . . . , n — 1. 
Prove that a compact, chain connected metric space is connected. 

.4 If X is a metric space, then it follows from Proposition (3.4.6) that 
for each x £ X, 

Ca; = (^ {S' C AT : S' is connected and a; G S} 

is connected. Cx is called the connected component of x in X. Prove 
the following statements. 

(i) Cx is closed in X. 

(ii) Cx is the largest connected subset of X that contains x. 

(hi) If y £ Cx, then Cy = Cx- 

(iv) If y ^ Cx, then CyC\Cx = 0- 

.5 A subset S of a metric space X is said to be totally disconnected if 
for each x £ X the connected component of a; in S is {a;}. Prove that 

(i) every countable subset of R is totally disconnected; 

(ii) the irrational numbers form a totally disconnected set in R. 

.6 A metric space X is said to be locally connected if for each x £ X and 
each neighbourhood U of X there exists a connected neighbourhood 
P of a; with V C U. Prove that X is locally connected if and only if 
the following property holds: for each open subset S of X, and each 
x £ S, the connected component of x in the subspace S is an open 
subset of X. 

.7 Use Proposition (3.4.3) and the previous exercise to give another 
proof of Proposition (1.3.6). 

.8 Let X be a connected space, and S a nonempty subset of X such that 
X\S is also nonempty. Show that the boundary of S is nonempty. 
(Suppose the contrary.) 

(3.4.8) Proposition. The range of a continuous mapping from a 
connected metric space into a metric space is connected. 

Proof. Let AT be a connected space, and / a continuous mapping of 
X into a metric space Y. Suppose that f{X) = S VJ T , where S,T are 
nonempty disjoint open sets in the subspace /(AT) of Y. By Proposition 
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(3.2.2), the nonempty disjoint sets / ^(S') and / ^(T) are open in X. 

X = /-I (/(X)) = f-\SUT) = f-\S) U f-\T), 
it follows that X is not connected, a contradiction. □ 

A very important consequence of Proposition (3.4.8) is the following 
generalised Intermediate Value Theorem. 

(3.4.9) Theorem. Let f be a continuous mapping of a connected metric 
space X into R, and a, b points of f{X) such that a < b. Then for each 
y G (a, b) there exists x G X such that f{x) = y. 

Proof. By Propositions (3.4.8) and (3.4.3), f{X) is an interval. The result 
follows immediately. □ 

(3.4.10) Exercises 

.1 Let X be an unbounded connected metric space. Prove that for each 
X G X and each r > 0 there exists y G X such that p{x, y) = r. 

.2 Let S' be a connected subset of the Euclidean space R". Prove that 
for each r > 0 the set 

{x G R” : p{x, S) < r} 

is also connected. 

.3 Let A be a compact metric space, and suppose that the closure of 
any open ball B{a,r) in X is the closed ball B{a,r). Show that any 
open or closed ball in X is connected. (Suppose that B{a,r) = SUT, 
where S, T are nonempty, disjoint, and closed in the subspace B{a, r). 
Without loss of generality take a in A. Show that 

C = {x G X\S : p{a,x) > p{a,T)} 

is compact, and hence that there exists to G T such that p{a,to) = 
p{a,T) > 0. Then consider B{a, p{a,T)).) 

Show by an example that we cannot remove the compactness of X 
from the hypotheses of this result. 

We now prove the partial converse to Corollary (3.3.13) that was 
postponed from Section 3. 

(3.4.11) Proposition. Let X be a connected metric space such that ev- 
ery continuous function from X to R is uniformly continuous. Then X is 
compact. 
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Proof. Suppose that X is not totally bounded. By Exercise (3.3.8: 6), 
there exist a sequence (xn) in X and a positive number a such that 
p{xm,Xn) > a whenever m ^ n. Using Exercise (3.3.8: 8), we can con- 
struct, for each k, a uniformly continuous function <f>k : X ^ [0, 1] such 
that 4>k{xk) = 1, 4>k{x) = 0 if p{x,Xk) > a/3, and / = is a 

well-defined continuous function on Jf; to be precise, we set 

4>k{x) = max {O, 1 — 3a“^p(a;, x^)} ■ 

Our hypotheses ensure that / is uniformly continuous. Now, X is con- 
nected, the mapping x i— p{x,Xn) is continuous on AT, p(x„,a;„) = 0, and 
p{xn+i,Xn) > a. It follows from Theorem (3.4.9) that there exists x G X 
such that p{x,Xn) = aj3n. Then 

f{x„) - f{x) = n - (n - 1) = 1. 



Since n > 1 is arbitrary, / is not uniformly continuous. This contradiction 
shows that X is totally bounded. 

Now suppose that X is not complete; so there exists a Cauchy sequence 
(xn) in X that does not converge to a limit in X. Without loss of generality 
we may assume that X is a dense subset of its completion (X,p). So (x„) 
converges to a limit x^o G X\X. The function x i— 1 p{x,Xao) is (uniformly) 
continuous and positive-valued on X, so 



fix) 



1 

pix,Xoo) 



defines a continuous mapping f : X -G R^. By our hypotheses, / is uni- 
formly continuous on X, so there exists 5 > 0 such that ii x,y G X and 
p{x,y) < S, then \ f{x) — f{y)\ < 1. Choose N such that p{xm,Xn) < <5 for 
all n> N. Since p(xn, Xoo) > 0, there exist positive integers k, m such that 
m > N and 

Then p{xm,XN) < S but 



f{xm) - /(xn) > {k + 1) - k = 1, 



contrary to our choice of <5. Hence, in fact, X is complete and therefore, by 
Theorem (3.3.9), compact. □ 



There is another type of connectedness of importance in analysis and 
topology, one that generalises the informal idea that a subset X of the 
Euclidean plane is in one piece if any two points of X can be joined by 
a path that lies wholly in X. (In spite of this correct claim about the 
importance of this type of connectedness, we do not actually use it later in 
the book; so you can ignore the rest of this section with impunity.) 
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Let X be a metric space. A continuous mapping / : [0, 1] — 1 A such that 
/(O) = a and /(I) = b is called a path in X with endpoints a and b, or a 
path in X from a to b; the path / is also said to join a to b. We say that X 
is path connected, or a path connected space, if for each pair a, b of points 
of X there is a path in X from a to b. By a path connected subset of X we 
mean a subset of X that is path connected as a subspace of X. 

A subset S of R" is said to be convex if tx + {1 — t)y £ S whenever 
x,y £ S and 0 < t < 1. A convex subset S of R" is path connected: for if 
a,b £ S, then 

f{t) = (1 — t)a + tb (0 < t < 1) 

defines a path in S from a to b. In particular, an interval in R is path 
connected. 

(3.4.12) Proposition. A path connected space is connected. 

Proof. Let A be a path connected space; we may assume that A is 
nonempty. Let a £ X, and for each x £ X let fx be a path in A joining a to 
X] for convenience, let I = [0, 1]. Then fx{I) is connected, by Propositions 
(3.4.3) and (3.4.8), and a G fx{I)- Hence, by Proposition (3.4.6), A = 
[jxex fM) is connected. □ 

Propositions (3.4.3) and (3.4.12) show that path connectedness and con- 
nectedness are equivalent properties of a nonempty subset S of R, and hold 
precisely when S is an interval. In R^, however, there are subsets that are 
connected but not path connected; see Exercise (3.4.16: 1). Our next result 
is therefore substantial. 

(3.4.13) Proposition. A connected open subset o/R" is path connected. 

In order to prove Proposition (3.4.13) we need some simple consequences 
of the following Glueing Lemma. 

(3.4.14) Lemma. Let A, Y be metric spaces, and let A, B be closed sub- 
sets of X whose union is X. Let f : A and g : B ^ Y be continuous 
functions such that f{x) = g{x) for all x £ A D B. Then the function 
h : X ^ Y defined by 

( f{x) a X £ A 

h{x) = 

[ g{x) ii X £ B 

is continuous. 

Proof. Let C be a closed subset of Y . Then, by Proposition (3.2.2), 
f~^{C) is closed in the subspace A of A, and hence, by Exercise (3.1.6: 3), 
in A. Similarly, g~^{C) is closed in A. Hence 

h-\C) = f-\C)Ug-\C) 
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is closed in X. It follows from Proposition (3.2.2) that h is continuous. □ 

Now consider two paths /, g in a metric space X such that /(I) = g(0). 
We define the product of the paths / and g to be the path gf, where 

r f{ 2 t) if 0 < t < i 

9f{t) = S 

[ g{2t — 1) if ^ t < 1. 

It follows from Proposition (3.4.14) that gf is a path in X joining /(O) to 
5(1)- 

The product gf of two paths must not be confused with the composite 
go f of two mappings. Indeed, unless / is a path in [0, 1], the composite of 
the paths / and g is undefined. 

(3.4.15) Exercise 

We define the path component of a point x in a metric space X to be 

Px = {y & X : there exists a path in X from x to y}. 

Prove that Px is the union of the path connected subsets of X that 
contain x, and that it is the largest path connected subset of X 
containing x. Prove also that if x,y G X, then either Px = Py or 
PxDPy = 0 . 

Proof of Proposition (3.4.13). Let C/ be a connected open subset of 
R". For each x in U let Ux be the path component of x in [/; we first show 
that Ux is open in U. Given j/ in Ux, choose a path f in U joining x to y; 
choose also r > 0 such that B{y,r) C U. Since B{y,r) is convex, for each 
2 G B{y, r) there exists a path g in B{y, r) joining y to z; then gf is a path 
in U joining x to z. Hence B{y,r) C Ux, ,and therefore Ux is open in U. 

Now suppose that U is not path connected. Then there exist distinct 
points of U that cannot be joined by a path in U . Let a be one of these 
points. By the foregoing, Ua is nonempty and open in U , as is 

P = |J{C/, : X G U\Ua}. 

Moreover, U = UaU V. Since U is connected, Ua fi V is nonempty, so 
there exists b G U\Ua such that UafiUb ^ 0. Exercise (3.4.15) shows that 
Ua = Ub] whence b G Ua, a contradiction. Thus U is path connected. □ 

(3.4.16) Exercises 



= {{0,y)GR^:-l<y<l}, 

= {(a;,2/) G : 0 < X < 1, y = sin ^} , 



.1 Let 



A 

B 
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and X = AVJB. Prove that any connected subset of X that intersects 
both A and B has diameter greater than 2 . Then prove that X is not 
path connected. (Suppose there exists a path / : [ 0 , 1 ] — 1 X with 
/(O) G A and /(I) G B. Let 

T = sup {t G [ 0 , 1 ] : / ([ 0 , t]) C A} , 

and show that there exists t' > t such that /(r') G B and the 
diameter of /([r, r']) is less than 1 .) 

.2 Let iF be a family of path connected subsets of a metric space X such 
that Pi .F yf 0 . Prove that IJ F is path connected. 

.3 Let be a sequence of path connected subsets of a metric space 

X such that for each n > 1 , 



n— 1 

n U 0. 

Prove that U^i is path connected. 

3.5 Product Metric Spaces 

Let {Xi,pi) and {X^, P2) be nonempty^ metric spaces, and X their Carte- 
sian product Xi X X2- Throughout this section we use such notations as 
X = {xi,X2), x' = (x'i,X2), and a = (01,02) for points of X; we write 
Bk{akX) (respectively, Bk{ak,r)) for the open (respectively, closed) ball 
in Xk with centre Ofc and radius r. 

It is a simple exercise to show that the mapping p : X x F — >■ R defined 
by 

p{x,y) = max{pi(xi,?/i),p2(a;2,2/2)} 

is a metric — called the product metric — on X; taken with this metric, X 
is called the product of the metric spaces Xi and X2. We assume that X 
carries this metric in the remainder of this section. 

There are at least two other natural metrics on the set X: namely, the 
metrics p' and p” defined by 

p'{x,y) = \/pi{xi,yiy + p2{x2,y2)'^ 



and 

p"{x,y) = pi{xi,yi) + P2{x2,y2)- 



^The requirement that Xi and X 2 be nonempty enables us to avoid some 
minor complications. 
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Since 

p{x,y) < p'{x,y) < p"{x,y) < 2 p{x,y), 

the identity mapping ix (see Exercise (3.2.1:!)) is uniformly continuous 
when its domain and range are given any of the metrics p,p',p". Hence, 
in particular, each of these three metrics gives rise to the same topology 
(family of open sets) on X ; that is, the metrics are equivalent (see Exercise 
(3.1.3:6)). 

(3.5.1) Lemma. The open ball with centre a and radius r in the product 
space X is Bi{ai,r) x B2{a2, r), and the closed ball with centre a and radius 
r in X is Bi{ai,r) x i?2(a2,r). 



Proof. For example, we have 

p{a,x)<r max{pi(ai,a;i),p2(a2,a;2)} < r 

pi(ai,a;i) < r and p2{a2,X2) < r, 

so B{a,r) = Bi{ai,r) X B2{a2,r). □ 

(3.5.2) Proposition. If A\ is open in X\, and A2 is open in X2, then 
Ai X A2 is open in X. 

Proof. Let 

a A = Ai X H2 . 

Then ai G Ai and 02 G A2] so there exist ri,r2 > 0 such that Hi(ai,ri) C 
Ai and 52(02, r2) C A2. Let r = min{ri,r2}; then by Lemma (3.5.1), 

B{a,r) C 5i(oi,ri) x 52(o2,r2) C A. 

Hence a G A°, and so A is open in A". □ 

(3.5.3) Corollary. IfUk is a neighbourhood of Xk in Xk, then Ui x U2 
is a neighbourhood of x in X. 

Proof. Choose an open set Ak in Xk such that Xk G Ak C Uk- Then 



{xi,X2) G Ai X A2 C Ui X U2, 

where, by the previous proposition, A\ x A2 is an open subset of X. □ 
The mapping pr^, : X -G Xk defined by 



pr^,(a;i,a;2) = Xk 



is called the projection of X onto Xk- 
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(3.5.4) Proposition. If A is an open set in X, then pr^,(A) is open in 
the space Xk- 

Proof. Consider any xi € Xi. Either 

A{xi) = {x2 G ^2 : (a;i,a;2) G A} 

is empty and therefore open, or else there exists X2 G A{x\). In the latter 
case, since A is open, we can choose r > 0 such that B{x,r) C A, where 
X = (xi,X2)- If x'2 G X2 and P2{x2,x2) < r, then 

p{x,{xi,X2)) = P2 {x 2,X2) < r, 

so {xi,x'2) G A. Hence A{xi) is open in this case also. Since 

Pr2(^) = U A{xi), 

aJiGXi 

a union of open sets, it follows that pr2(H) is open in X2- A similar 
argument shows that pr]^(A) is open in Ai. □ 

Note that the projections of a closed subset of X need not be closed; see 
the remarks following the proof of Proposition ( 3 . 2 . 2 ) on page 137 . 

(3.5.5) Proposition. If A\ C X\ and A2 C X2, then 

A\ X A2 = Ai X A2. 



Proof. Let a G Ai x A2. Then for each £ > 0 there exist Xi G Ai and 
X2 G A2 such that pi(ai,a:i) < £ and ^2(^2) 2^2) < £; whence p{a,x) < £, 
where 

X = (xi,X2) G Ai X A 2 . 

Thus Ai X A2 C Ai X A2. 

On the other hand, if a ^ Ai x A2, then either ai ^ Ai or 02 ^ A2. 
Taking, for example, the first alternative, we see from Exercise ( 3 . 1 . 3 : 3 ) 
and Corollary ( 3 . 5 . 3 ) that the set (Ai\Ai) x A2, which is clearly disjoint 
from Ai X A2, is a neighbourhood of a; thus a ^ A\ x A2. Hence 

X\ (Ai X A2) C A\Ai X A2, 

so Ai X A2 C Ai X A2, and therefore Ai x A2 = Ai x A2. □ 

(3.5.6) Corollary. A\ X A 2 %s closed %n X if and only if Aj^ %s closed 
in Xk for each k. 



Proof. This follows immediately from the last proposition. □ 




168 



3. Analysis in Metric Spaces 



A mapping / from a set E into X = X\ x X 2 can be identified with the 
ordered pair (pr^ o /, pr2 o /); where there is no risk of confusion, we write 
/fe for the mapping pr^, o f oi E into Xf^, so that / = (/i, /2)- 

(3.5.7) Proposition. Let f be a mapping of a metric space (E,d) into 
X. Then f is continuous at a G E if and only if both f\ and /2 are 
continuous at a. 

Proof. Suppose that for each k, fk is continuous at a^. Given e > 0 , 
choose Sk > 0 such that if d{a,x) < Sk, then Pk{fk{o), fk{x)) < s. If 
d{a, x) < min { 5 i, ^2} > then 

P{f{a),f{x)) = max{pi(/i(a),/i(a;)), P2(/2(a), /2(x))} < e. 

Thus / is continuous at a. 

To prove the converse, first note that, trivially, pr^, is continuous on X; 
so if / is continuous at a, then so is pr^, o /, by Proposition ( 3 . 2 . 3 ). □ 

(3.5.8) Proposition. Let f be a mapping of a metric space E into X . 
Then f is uniformly continuous if and only if both f\ and /2 are uniformly 
continuous. 

Proof. This is left as an exercise. □ 

(3.5.9) Exercises 

.1 Prove that if a mapping f oi X into a metric space Y is continuous 
at (a, 6), then the mappings Xi 1— f{xi,b) and X 2 >— >■ f{a,X 2 ) are 
continuous at a and b, respectively. 

.2 Prove Proposition ( 3 . 5 . 8 ). 

.3 Let E be a metric space, A C E, and a G A\{a}. Prove that a 
mapping f : E ^ X has a limit at a with respect to A if and only if 
both 61 = Umt^a.teA fi(t) and &2 = f 2 (t) exist, in which 

case limt_>a_ teAf(t) = (61,62). 

.4 Prove that a sequence (Xn) in X converges to a limit in X if and 
only if both = lim„_>oo pri(a;„) and ^2 = lim„_>oo pr2(xn) exist, in 
which case lim„_>oo Xn = (6>6). 

.5 Prove that a sequence (x„) in X is a Cauchy sequence if and only 
if (pri(a;„)) is a Cauchy sequence in Xi and (pr2(a;„)) is a Cauchy 
sequence in X 2 - 
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.6 For i = 1,2 let Xi,Yi he metric spaces, and fi a mapping of Xi into 
Yi. Prove that the mapping 

{xi,X2) H> (/l(xi),/2(x2)) 

of X\ X X2 into Yi X I 2 is continuous if and only if both /i and /2 
are continuous. 

We have now reached the main result of this section. 

(3.5.10) Proposition. Let T be any one of the following types of metric 
space: complete, totally hounded, compact. Then the product X = XiX X 2 
of two nonempty metric spaces Xi and X 2 is of type T if and only if both 
Xi and X 2 are of type T. 

Proof. Leaving the necessity of the stated conditions as an exercise, we 
prove their sufficiency. To this end, assume that Xi and X 2 are com- 
plete, and consider a Cauchy sequence (x„) in X. By Exercise (3.5.9: 5), 
(pr^(a;„))^;^ is a Cauchy sequence in X^; since X^ is complete, 

fk = lim prk(xn) 

n—¥oo 

exists. Reference to Exercise (3.5.9: 4) shows that (x„) converges to the 
point (Cl, ^ 2 ) of X. Hence X is complete. 

It is easy to see that if £ > 0 and Fk is a finite e-approximation to Xk, 
then Fi x F 2 is a finite e-approximation to X. It follows that if Xi and X 2 
are totally bounded, so is X. 

The first two parts of the proof, and Theorem (3.3.9), show that if X\ 
and X 2 are compact, then so is X. □ 

(3.5.11) Exercises 

.1 Prove that the product of two discrete metric spaces is discrete. 

.2 In the notation of Proposition (3.5.10), prove that if X is of type T , 
then so are X\ and X 2 . 

.3 Prove that X is separable if and only if both X\ and X 2 are separable. 

.4 Prove that X is locally compact if and only if both X-y and X 2 are 
locally compact. 

.5 Prove that X is connected (respectively, path connected) if and only 
if both X\ and X 2 are connected (respectively, path connected). 

.6 Prove that a subset of the product space or is compact if and 
only if it is closed and bounded. 
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.7 Prove that the Euclidean spaces and are complete. 

.8 Show that in the product space the set 

X = ({0}x [0,l])U([0,l]x{0}) 

is compact, that every ball in X is connected, but that the closure 
of an open ball in X need not be the corresponding closed ball (cf. 
Exercise (3.4.10:3)). 

We define the product of a finite family {Xi, pf ), . . . , (X„, p„) of metric 
spaces to be the metric space (X,p), where 

a: = X • • • X a:„ 

and 

p{{xi, . . {yi , . . .,?/„)) = max{p,{xi,yi) : i = 1, . . . ,n}. 

The results proved so far in this section extend in the obvious ways to 
a product of more than two, but finitely many, metric spaces. The final 
set of exercises in this chapter shows how we can handle the product of a 
sequence of metric spaces. 

(3.5.12) Exercises 

.1 Let {{Xn, Pn))^i be a sequence of nonempty metric spaces such that 
diam(X„) < 1 for each n. Let X be the set of all sequences {xn)'ff=i 
such that Xn G Xn for each n, and define a mapping p : X x X ^ H 

by 

oo 

P ((^rt)i ivn)) ~ ^ ' 2 Pn{Xn,yn)- 

n—1 

Prove that p is a metric on X. 

The metric space {X, p) is called the product of the sequence (X„) of 
metric spaces and is usually denoted by n: — 1 • 

The next four exercises use the notation of Exercise (3.5.12: 1). 

.2 Let X = {xn)'i^=i be a point of X. Prove that C/ C AT is a neighbour- 
hood of cc in Ai if and only if for some positive integer m and some 
r > 0, U contains a set of the form 

Um{x,r) = {{yn)fff=i G X : Pi{xi,yi) < r for 1 < f < m} . 

.3 For each n let be a subset of Ai„. Prove that the closure of 

nr=l in ^ is f[n=l 
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.4 For each k let Xk = {xk,n)'^=i be a point of X. Prove that the sequence 
i^k)T=i converges in X to a limit a = if and only if for each n 

the sequence {xk,n)^^i converges to a„ in X„. Prove also that 
is a Cauchy sequence in X if and only if for each n the sequence 
{xk,n)'^i is a Cauchy sequence in 

.5 With T as in Proposition (3.5.10), prove that X is of type T if and 
only if Xn is of type T for each n. 

.6 Let {Xn)'^^i be a sequence of discrete metric spaces, each having 
positive diameter < 1. Prove that the product space is not 

discrete. 




4 

Analysis in Normed Linear Spaces 



./ could be bounded in a nutshell, and 
count myself a king of infinite space... 

HAMLET, Act 2, Scene 2 



Many significant applications of analysis are the fruit of cross-fertilisation be- 
tween metric structure and algebraic structure. In this chapter we discuss such 
a cross-breed: a normed (linear) space. Section 1 introduces these objects and 
deals with their elementary analytic and geometric properties. In Section 2 we 
discuss linear mappings between normed spaces, paying particular attention to 
bounded linear functionals — continuous linear mappings into R and C. Although 
many of the most important normed spaces of analysis are infinite-dimensional, 
finite-dimensional ones remain significant in many ways; they are dealt with in 
Section 3. The next two sections deal with two fundamental classes of infinite- 
dimensional complete normed spaces: the Lp integration spaces and the space 
C{X) of continuous functions from a compact metric space X into R. They also 
characterise the associated bounded linear functionals. Two of the most impor- 
tant results about C{X) — Ascoli’s Theorem and the Stone-Weierstrass Theo- 
rem (a far-reaching generalisation of the classical Weierstrass Approximation 
Theorem) — are proved in Sections 5 and 6. Both of these theorems reappear in 
the final section of the chapter, where they are applied to the concrete classical 
problem of solving ordinary differential equations. 
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4.1 Normed Linear Spaces 

Metric spaces offer one context within which the analytic and topological 
properties of R can be generalised, but they do not provide a natural frame- 
work for a generalisation of the algebraic properties of R. A framework of 
the latter sort is made available by the notion of a normed linear space. 

Let F stand for either R or C, and let X he a linear space (vector space) 
over F. A norm on X is a mapping x i— ||a;|l of X into R such that the 
following properties hold for all x,y G X and A G F. 

N1 |p||>0. 

N2 ||a;|| = 0 if and only if x = 0. 

N3 ||Ax|| = |A| INI. 

N4 II X + y\\ < ||x|| -I- IIj/II {triangle inequality). 

A normed linear space, or normed space, over F is a pair {X, H-H) consisting 
of a linear space X over F and a norm H-H on X; by abuse of language, we 
refer to the linear space X itself as a normed space if it is clear from the 
context which norm is under consideration. We say that the normed space 
X is real or complex, depending on whether F is R or C. A vector with 
norm 1 is called a unit vector. 

The simplest example of a norm is, of course, the mapping x i— >■ |x| on 

F. 

If X is a normed space, then the mapping {x,y) !->■ ||x — i/|| of X x X into 
R is a metric on X (Exercise (4.1.1: 1)), and is said to be associated with 
the norm on X. When we consider X as a metric space, it is understood 
that we are referring to the metric associated with the given norm on X. 
By the unit ball of X we mean the closed ball with centre 0 and radius 1, 

;B(0,1) = {xGX: Nil < 1}, 

relative to the metric associated with the norm on X. 

(4.1.1) Exercises 

.1 Prove that p{x,y) = ||x — y\\ defines a metric on a normed space X, 
such that 



p{x + z, y + z) = p{x,y), 
p{\x, \y) = |A| p{x,y) 



for all x,y,z G X and A G F. 
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.2 Show that 

llkll-llylll <lk-2/ll 

for all vectors a;, y in a normed space X. Hence prove that if a sequence 
{xn) converges to a limit x in X, then ||a;|| = lim„_>oo ||a;„|| . 

.3 Prove that 

Ikll = inf t G F,t ^ 0, ||te|| < l| 

for each element cc of a normed space X. 

.4 Prove that for each positive integer n the mappings 
(xi, . . . ,a;„) max{|a;i| , . . . , |a;„|} , 

{xi,...,Xn) \Jx\^ hx2, 

{xi ,. . . ,a;„) \xi\ H h |x„| 



are norms on F". In the case F = R the second of these norms is 
called the Euclidean norm on R”, and the associated metric is the 
Euclidean metric (see Exercise (3.1.1: 5)). 

.5 Let X be a nonempty set, and denote by B{X, F) the set of all 
bounded mappings of X into F, taken with the pointwise operations 
of addition and multiplication-by-scalars: 

if + g) (x) = f{x)+g{x), 

(A/) (x) = Xf{x). 

The supremum norm, or sup norm, on B{X, F) is defined by 
ll/ll =sup{|/(a;)| :xeX}. 

Verify that the sup norm is a norm on B{X, F). 

.6 Prove that ||/||;^ = / |/| defines a norm on the set Li(R) of all 
Lebesgue integrable functions (defined almost everywhere) on R, 
where two such functions are taken as equal if and only if they are 
equal almost everywhere. 

.7 Let Xi,X 2 be normed spaces over F, and recall that the standard 
operations of addition and multiplication-by-scalars on the product 
vector space X = Xi x X 2 are given by 

(X 1 ,X 2 ) + (x'i,X 2 ') = (Xi + x{, X 2 + x' 2 ) , 

X (xi,X2) = (Xxi,Xx2) ■ 
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Verify that the mapping (xi,X2) >— l max{||a;i|| , ||x2||} is a norm on 
X, and that the metric associated with this norm is the product 
metric on X (considered as the product of the metric spaces Xi and 
X2). 

Taken with this norm, which we call the product norm, X is known as 
the product of the normed spaces Xi and X2. The product norm and 
the product space for a finite number of normed spaces are defined 
analogously. 



( 4 . 1 . 2 ) Proposition. Let X be a normed space over F. Then 

(i) the mapping (x,y) x + y is uniformly continuous on X x X; 

(ii) for each A G F the mapping x \x is uniformly continuous on X; 

(iii) for each x € X the mapping \x is uniformly continuous on F; 

(iv) the mapping (A, x) 1— Aa; is continuous on F x V. 

Proof. The uniform continuity of the first three mappings follows from 
the relations 



\\(x + y)-ix' + y')\\<\\x-x'\\ + \\y-y'\\, 
llAx- Ay|| = |A| ||a;-2/||, 

I|A^-A'x|| = |A-A'| ||x||. 

On the other hand, the relations 

||Ax - Aoa^oll = ||Ao(a; - xq) + (A - Ao)xq + (A - Ao)(x - xo)|| 

< |Ao| ||x - xoll + |A - Ao| ||xo|| + |A - Aq| ||x - xo|| 

easily lead to the continuity of (A,x) 1 — Xx at (Ao,xo). □ 

If AT is a normed space and S' is a linear subset of X , then the restriction 
to S of the norm on AT is a norm on S; taken with this norm, S is called a 
normed linear subspace, or simply a subspace, of the normed space X. 

( 4 . 1 . 3 ) Proposition. If S is a subspace of a normed space X, then the 
closure of S in X is a subspace of X. 

Proof. Let / be the mapping (x,y) 1-^ x + y of X x X into X. As S is a 
subspace, / maps S x S into S, so 



s X s c r\s) c r\s), 
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and therefore S' x S' is a subset of the closure of /“^(S). Since, by Propo- 
sition (4.1.2), / is continuous on X x X, it follows from Proposition 
(3.2.2) that /“^(S) is a closed subset of X\ whence S x S C /“^(S). But 
S X S = Sx S, by Proposition (3.5.5); so if a; G S and y € S, then x+y G S. 
A similar argument, using the continuity of the mapping (A, x) Xx, shows 
that if A G F and x G S, then Ax G S. □ 

(4.1.4) Lemma. If S is a closed subspace of a normed space X, and 
a € X, then 

a + S = {a + x : X € S} 

is closed in X. 

Proof. Let / be the mapping z z — a of X into itself. Since / is the 
composition of the mappings z !-->■ (z, —a) and (x, y) x + y, it follows 
from Exercise (3.5.9: 1), Proposition (4.1.2), and Proposition (3.2.3) that / 
is continuous on X. But a -I- S' = /“^(S); so, by Proposition (3.2.2), a -I- S 
is closed in A. □ 

(4.1.5) Exercises 

.1 Explain why, in Proposition (4.1.2), the mapping (A,x) i— Ax is not 
uniformly continuous on F x A. 

.2 Complete the proof of Proposition (4.1.3). 

.3 Let (A,p) be a metric space, and A a normed space such that 

(i) P(x, y) = \\x - y\\ for all x,y G X; 

(ii) A is dense in A. 

Show that the operations of addition and multiplication-by-scalars 
can be extended uniquely to make A a normed space with associated 
metric the given metric p. (Use Propositions (4.1.2) and (3.2.12).) 

.4 Let A and B be nonempty subsets of a normed space A, and define 

A + B = {x + y ■. x G A, y G B} . 



Prove that 

(i) if A is open, then A + B is open; 

(ii) if A is compact and B is closed, then A + B is closed. 



Need A -I- i? be closed when A and B are closed? 
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.5 Recall that a subset C of a linear space is said to be convex if tx + 
(1 — t)y G C whenever x,y G C and 0 < t < 1. Prove that the closure 
of a convex subset of a normed space is convex. 

.6 Let C be a nonempty closed convex subset of a normed space X, xg a 
point of X\C, and r a positive number such that Cr\B{xg,r) is empty. 
Prove that C + B{0, r) is open and convex, and that Xg ^ C + B(0, r). 

.7 Let C be a nonempty convex subset of R”, and xg a point of C\C. 
Prove that each open ball with centre Xg intersects the complement 
of C. (First consider the case where C has a nonempty interior.) Does 
the conclusion hold if we drop the hypothesis that C is convex? 

In a tribute to one of the founders of functional analysis, the Polish 
mathematician Stefan Banach (1892-1945), a complete normed linear space 
is called a Banach space. Among examples of Banach spaces are 

• Euclidean n-space R" (Exercise (4.1.1: 4)); 

• B{X,F) where Ai is a nonempty set and the norm is the sup norm 
(Exercise (4.1.1: 5)); 

• certain spaces of continuous or integrable functions that we consider 
in later sections of this chapter. 



(4.1.6) Exercises 

.1 Let cg be the real vector space (with termwise algebraic operations) 
consisting of all infinite sequences x = (x„)'^^i in R that converge to 
0. For each x G cg write 

||a;|| = sup \x„\. 

n>l 

Prove that this defines a norm on cg with respect to which cg is 
a separable Banach space. (For the second part, consider a Cauchy 
sequence (a:fc)^i in cg, where for each k, Xk = {xk,n)’^^i- Show that 
for each n, {xk,n)’k=i ^ Cauchy sequence in R. Denoting its limit by 
show that (Cn))))Li belongs to cg and is the limit of the sequence 
{xk)^^i in the space cq .) 

.2 Let 1 1 denote the space of all sequences of real numbers such that 
the corresponding series is absolutely convergent, and for each x = 
{xn)n=i G h write 

OO 

n— 1 

Prove that this defines a norm on l\ with respect to which li is a 
separable Banach space. 
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.3 Let loo denote the space of all bounded sequences of real numbers, 
and for each x = (xn)^=i G loo write 

lla^lloo =sup|a;„|. 

n>l 

Prove that this defines a norm on loo with respect to which loo is a 
Banach space. 

.4 Prove that if X is a nonempty set, then B{X,F), taken with the 
supremum norm, is a Banach space. 

We now sketch how any normed space X can be embedded as a dense 
subspace of a Banach space. Defining 

<l)x{y) = \\x - y\\ {x,y G X), 

Y = {<l,o + f:fGB{X,Il)}, 
d{F,G) = sup {\F{x) - G{x) \ :xGX} (F,GgF), 

we recall from Exercise (3.2.10:8) that {Y,d) is a complete metric space, 
that X 1-^ (j)x is an isometric mapping of X onto a subset Z of Y, and that 
the closure X of Z is a complete subspace of Y. We transport the algebraic 
structure from X to Z by defining 



4^x F 4^y — 4^x+yj 
^4^x ~ 4^Xx 



for all x,y G X and A G F. Then 

HxW = d{(j)x,(j)o) = ||a:|| 

defines a norm on Z whose associated metric is the one induced by d. 
Using Exercise (4.1.5: 3), we can extend the operations of addition and 
multiplication-by-scalars uniquely from X (identified with its image under 
the mapping a; i— </>x) to X, thereby making X a Banach space in which 
there is a dense linear subspace isometric and algebraically isomorphic to 
X. 

In practice, we normally forget about the mapping x ^ 4>x and regard 
X simply as a dense subspace of X, which we call the completion of the 
normed space X. 

(4.1.7) Exercise 

Fill in the details of the proof that the foregoing constructions provide 
X with the structure of a Banach space and that x (px is & norm- 
preserving algebraic isomorphism of X with a dense subspace of X. 




180 



4. Analysis in Normed Linear Spaces 



Banach spaces form the natural abstract context for the notion of con- 
vergence of series. Given a sequence (a;„) of elements of a normed space X, 
we define the corresponding series to be the sequence (s„), where 

Sn = is the nth partial sum of the series. The series i® 

said to be 

• convergent if the sequence (s„) converges to a limit s in AT, called the 
sum of the series, 

• absolutely convergent if the series ll^^nll i® convergent in R, 

• unconditionally convergent if f{n) converges for each permu- 

tation / of N'*" . 

In the first case we write = s. 

(4.1.8) Exercises 

.1 Prove that a series ^ Banach space X converges if and 

only if for each e > 0 there exists a positive integer N such that 
II Y.n=r +1 ^ri\\ < £ whenever j > i> N. 

.2 Prove that an absolutely convergent series in a Banach space is 
unconditionally convergent. (See Exercise (1.2.17: 1).) 

.3 Let A" be a normed linear space, and suppose that each absolutely 
convergent series in X is convergent. Prove that X is a Banach space. 
(Given a Gauchy sequence (x„) in X, choose n\ < U 2 < ■ ■ ■ such 
that ||a;i — ccjll < 2“^ for all i,j > Uk- Then consider the series 

i^rik+i ~ ^rik) •) 

.4 In the Banach space cq of Exercise (4.1.6: 1), for each positive integer 
n let Xn be the element with nth term 1/n and all other terms 0. 
Prove that the series is unconditionally convergent but not 

absolutely convergent. 



Exercises (1.2.17: 1 and 2) show that a series in R is unconditionally 
convergent if and only if it is absolutely convergent. Exercise (4.1.8: 4) 
shows that this need not be true if R is replaced by an infinite-dimensional 
Banach space. In fact, if every unconditionally convergent series in a Banach 
space X is absolutely convergent, then X is finite-dimensional; this is the 
Dvoretsky-Rogers Theorem (see [12], Ghapter VI). 

Let AT be a normed space over F, and S a linear subspace of X. Then 



a; ~ y if and only ii x — y € S 
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defines an equivalence relation on X. The set of equivalence classes under 
this relation is written X/ S and is called the quotient space of X by S. The 
canonical map ip ai X onto X/S is defined by 

tf{x) = {a; + s : s e S'} , 

and maps each element of X to its equivalence class under We define 
operations of addition and multiplication-by-scalars on X/S by 

ip{x) + ip(ij) = p{x + y), 
ip{\x) = \lp{x). 

These definitions are sound: for ii x ^ x' and y ^ y', then x + y ^ x' + y' 
and Xx ~ Xx'. If S is a closed linear subspace of X, then 

||(^(a;)|| = p{x,S) = inf lllx- s|| : s G S| 

defines a norm, called the quotient norm, on X/S. In that case we assume 
that X/S is equipped with the foregoing algebraic operations and with the 
quotient norm. 

(4.1.9) Exercises 

.1 Verify the claims made without proof in the preceding paragraph. 

.2 Prove that if S is closed in X, then the canonical map p : X ^ X/S 
is uniformly continuous on S. 

(4.1.10) Proposition. If S is a closed linear subspace of a Banach space 
X, then the quotient space X/S is a Banach space. 

Proof. Let p be the canonical map of X onto X/S, and consider a se- 
quence (xn) in X such that (p(xn)) is a Cauchy sequence in X/S. Choose 
a strictly increasing sequence of positive integers such that 

lk(a;n,+i) - <p(x„J|| < 2"'= (/c>l). 

Setting Si = 0, we construct inductively a sequence (sk) in S such that for 
each k, 

||(2^rifc + i ~ Sfc+l) — (x„j, — Sfc)|| < 2 . (1) 

Indeed, having constructed elements si,...,Sfc of S with the applicable 
properties, we have 

inf - (xn^ - Sk) - s|| : s G S'} 

= inf - Xn^ - s|| : s G S} 

= ||9^(2:nfc + i) ~ 'P(2^nfc)|| < 2 , 
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SO there exists Sfc+i G S such that (1) holds. 

We now see from (1) that (x„j, — is a Cauchy sequence in the 

Banach space X; whence it converges to a limit z in X. By Exercise 
(4.1.9: 2), 

(p(Xnk) = V(Xnk) - V(sk) = ‘fiiXnk ~ Sfc) (fi{z) aS A OO. 

Thus the Cauchy sequence {(p{xn)) has a convergent subsequence. It follows 
from Exercise (3.2.10: 3) that {(p{xn)) itself converges in X\S. □ 



4.2 Linear Mappings and Hyperplanes 

In the context of normed spaces, the important mappings are not just 
continuous but also preserve the algebraic structure. 

Recall that a mapping u between vector spaces X, Y is linear if 

u{x + y) = u(x) + u(y) 



and 



u(Ax) = Au(x) 



whenever x,y G X and A G F. If E = F, then u is called a linear functional 
on X . Examples of linear mappings are 



• the mapping x i — Ax on F”, where A is an n-by-n matrix over F; 

• the Lebesgue integral, regarded as a mapping of Li(R) into R (see 
Exercise (4.1.1: 6)); 

• the mapping {xn)l^=i '->■ xi of cq into R (see Exercise (4.1.6: 1)); 

• the canonical mapping of a normed space X onto the quotient space 
X/S, where S' is a closed subspace of X; 

• the mapping x (f>x of a, normed space onto a dense subspace of its 
completion (see page 179). 



Here is the fundamental result about the continuity of linear mappings 
between normed spaces. 



(4.2.1) Theorem. The following are equivalent conditions on a linear 
mapping of a normed space X into a normed space Y. 

(i) u is continuous at 0. 

(ii) u is continuous on X. 

(iii) u is uniformly continuous on X. 
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(iv) u is hounded on the unit ball of X. 

(v) u is hounded on each bounded subset of X. 

(vi) There exists a positive number c, called a bound for u, such that 
||'w(x)|| < c ||a;|| for all x G X. 

Proof. Suppose that u is continuous at 0. Then there exists r > 0 such 
that 

IKx)|| = |Kx)-u(0)||<i 

whenever ||a;|| < r. For each nonzero t G F with ||ta;|| < 1 we have ||rte|| < r 
and therefore 



||u(a:)|| = r ^ |t| ^ \\u{rtx)\\ <r ^ \t\ \ 



It follows from Exercise (4.1.1: 3) that ||u(a:)|| < ||a;|| for all x G X. 

Hence (i) implies (vi). 

It is clear that (vi) (v) (iv). Next, suppose that there exists c > 0 

such that ||M(a:)|| < c whenever ||a;|| < 1. Since 



||u(a:) II = 11x11 u(||x|| ^ x) <c||x|| (x yf 0) 



and u(0) = 0, we see that (vi) holds, with c a bound for u. We now have 



llu(x-y)|| < c||x-y|| {x,y G X) , 



from which it follows that u is uniformly continuous on X. Thus (iv) 
(vi) ^ (hi). 

Finally, it is obvious that (iii) (ii) (i). □ 



In view of property (v) of Proposition (4.2.1), we commonly refer to a 
continuous linear mapping between normed spaces X,Y as a bounded linear 
mapping on X. We define the norm of such a mapping by 

||u|| = sup{||m(x)|| : X G X, ||x|| < 1} . (1) 

The argument used to prove that (iv) (vi) in the last proof shows that 
||u(x)|l < ||m|| ||x|| {xGX). 

In Exercise (4.2.2: 11) you will prove that equation (1) defines a norm on 
the linear space L{X, Y) of all bounded linear mappings u : X ^ Y, and 
that if F is a Banach space, then so is L{X, Y). The Banach space L{X, F), 
consisting of all bounded linear functionals from X into its ground field F, 
is called the dual space, or simply the dual, of X, and is denoted by X*. The 
interplay between a Banach space and its dual is one of the most significant 
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themes of functional analysis, so we spend some time later in this chapter 
and in Chapter 6 identifying the duals of certain important Banach spaces. 

Two norms H-H , |l-|f on a vector space X are said to be equivalent if 
both the identity mapping from {X, H-H) onto {X, ||-|f) and its inverse are 
continuous; since those mappings are linear, it follows from Proposition 

(4.2.1) that II ’ll and ||-|f are equivalent norms on X if and only if there 
exist positive constants a, b such that a ||a;|| < ||a;|f < b ||a:|| for all x G X. 

(4.2.2) Exercises 

.1 Prove that a linear mapping u : X ^ Y between normed spaces is 
bounded if and only if there exists c > 0 such that ||u(a:)|| < c for all 
X G X with ||a;|| = 1, and that we then have 

||u|| = sup{||u(x)|| :xGX, ||x|| = 1}. 

.2 Let u be a bounded linear mapping on a normed space X. Prove that 
||m|| = inf {c > 0 : ||u(x)|| < c ||a;|| for all x G X} . 

.3 Show that any two of the three norms on R" introduced in Exercise 
(4.1.1: 4) are equivalent. 

.4 Let II ’ll , ll’lf be equivalent norms on a linear space X. Prove that if 
X is complete with respect to H-H , then it is complete with respect 

to i|.ir. 

.5 Let Xi, . . . , Xn be normed spaces, X = Xi x ■ ■ ■ x X^, and Y a 
normed space. Let m be a multilinear mapping of X into Y — that is, 
a mapping linear in each of its n variables. Prove that u is continuous 
if and only if there exists a constant c > 0 (which we call a bound for 
u) such that 

||u(a;i, . . . ,x„)|| < c||xi|| ||a; 2 || ••• ||x„|| 
for all (xi, . . . , Xn) G X. 

.6 Let X, Y be normed spaces, and u : X ^ Y a linear mapping such 
that for each sequence (xn) in X converging to 0, the sequence (u{xn)) 
is bounded in Y. Prove that u is continuous. (Let (a;„) be a sequence 
converging to 0 in X, reduce to the case where ||x„|| < 1/n^ for each 
n, and then consider the sequence (nx„).) 

.7 Prove that a linear mapping u : X ^ Y between normed spaces is 
bounded if and only if for each Cauchy sequence (a;„) in X, (u(xn)) 
is a Cauchy sequence in Y. 




4.2 Linear Mappings and Hyperplanes 185 



8 Let u be a continuous linear mapping of a normed space X into a 

Banach space Y, and let be an absolutely convergent series 

in X. Prove that the series 'w(a^n) converges absolutely in Y. 

9 Recalling the Banach space l\ of Exercise (4.1.6: 2), for each n let 
be the element of l\ with nth term 1 and all other terms 0. Show 
that to each bounded sequence (a;„) in a Banach space X there cor- 
responds a unique bounded linear mapping u : h ^ X such that 
u(e„) = Xn for each n. 

Now let X be a separable Banach space, and (x„) a dense sequence in 
the unit ball B of X. Define the bounded linear mapping u : l\ ^ X 
as previously. Prove that u maps h onto X. (Given x G B, construct 
inductively rii < U 2 < ■ ■ ■ such that 



'ik— 1 



(x-x„J-^2 



k-j r 



1=2 



< 2 



— k 



for each k.) 

Thus every separable Banach space is the range of a bounded linear 
mapping on li. For further results of this type see [12]. 

10 Let D be a dense linear subspace of a normed space X, and u a 
bounded linear mapping from D into a normed space Y. Prove that 
u extends to a bounded linear mapping, with the same norm, from 
X into Y. (First use Proposition (3.2.12).) 

11 Prove that if X, Y are normed spaces, then 



ll'ull = sup{||'u(a:)|| : x G X, ||a;|| < 1} 

defines a norm on L{X, F), and that if Y is complete, then L{X, Y) 
is a Banach space with respect to this norm. (To establish the com- 
pleteness, let (un) be a Cauchy sequence in L{X,Y), and show 
that 

u{x) = lim Un{x) 

n—^oo 

defines an element u of L{X, Y) such that ||u — ■u„|| — 1 0 as n — 1 oo.) 

12 Let Co be the Banach space of Exercise (4.1.6: 1). For each positive 
integer n let e„ be the sequence whose nth term is 1 and which has 
all other terms equal to 0. Let n be a bounded linear functional on cq. 
Prove that the series ^(^n) is absolutely convergent, and that 

the norm of u is W(sn)\ - Conversely, prove that if 

absolutely convergent series of real numbers, then there is a unique 
bounded linear functional u on cq such that u(e„) = for each n. 
Describe u{x), where x = {xn)'^=i G cq. 
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This example shows that the dual space Cq can be identified with the 
Banach space l\ of Exercise (4.1.6: 2). 

.13 Prove that l\ can be identified with the Banach space loo of Exercise 
(4.1.6: 3). 

.14 Prove the Uniform Boundedness Theorem: let be a family 

of bounded linear mappings from a Banach space X into a normed 
space Y, such that {||Tia;|| : i G /} is bounded for each x £ X] then 
{||Ti|| : i £ 1} is bounded. (Suppose the contrary. Then construct 
sequences {xn)'^=i in X and {in)'^=i in / such that for each n, 

IKII=4-”, 

r.„x„||>i||r,j|||x„||, 

and 

||Ti„|l >3x4” ( n + sup {\\Ti {xi H hx„_i)||} 

V iei 

Taking x = ^n, deduce the contradiction that ||7i„a:|| > n for 

each n. 

This proof was published in [22]. A less elementary, but more stan- 
dard, approach to the Uniform Boundedness Theorem is based on 
Baire’s Theorem (6.3.1) and is discussed in Chapter 6.) 

.15 A normed space X is said to be uniformly convex if it has the following 
property: for each £ > 0 there exists 5 G (0, 1) such that ||a; — y\\ < e 
whenever ||x|| < 1, ||?/|| < 1, and || ^ (a^ + 2/) || > 1 — <5. Prove that if u 
is a bounded linear functional on a uniformly convex Banach space 
X, then there exists a unit vector x £ X such that |■u(x)| = ||w|| . 

Recall that the kernel, or null space, of a linear mapping u : X ^ Y 
between vector spaces is the subspace 

ker(w) = u~^{0) = {x £ X : u{x) = 0} 

of X. We say that u is nonzero if ker(w) ^ X — that is, if there exists x £ X 
such that u{x) yf 0; otherwise, u is said to be zero. 

(4.2.3) Proposition. A linear functional on a normed space X is 
continuous if and only if its kernel is closed in X. 

Proof. Let m be a linear functional on X, and S = ker(rt). As {0} is a 
closed subset of X, Proposition (3.2.2) shows that if u is continuous, then 
S is closed in X. Suppose, conversely, that S is closed in X. Since the zero 
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linear functional is certainly continuous, we may assume that there exists 
a G X such that u{a) = 1. Then 0 ^ a + S'. On the other hand, by Lemma 

(4.1.4) , a + S is closed in X, so its complement is open. Hence there exists 
r > 0 such that x ^ a + S whenever ||x|| < r. Suppose that ||a;|| < r and 
|'u(x)| > 1, and let y = u{x)~^x. Then \\y\\ <r, so y^a+S. On the other 
hand, 

u{y — a) = u{x)~^u{x) — 1 = 0, 
so y — a G S, and therefore 

y = a + {y — a) G a + S. 

This contradiction shows that |M(a;)| < 1 whenever ||a:|| < r. It follows from 
Proposition (4.2.1) that u is continuous. □ 

As we show in a moment, nonzero linear functionals on a normed space 
X are associated with certain subspaces of X which we now define. 

A subspace iL of a vector space X is called a hyperplane if 

— X\H is nonempty, and 

— for each a G X\H and each x G X there exists a unique pair (t, y) G 
F X such that x = ta + y. 

This expression of the element x is called its representation relative to the 
pair {H, a) consisting of the hyperplane H and the element a of X\H. 

(4.2.4) Proposition. The kernel of a nonzero linear functional on a 
normed space X is a hyperplane in X. Conversely, if H is a hyperplane 
in X and a ^ H, then there exists a unique linear functional u on X such 
that ker(u) = H and u(a) = 1. 

Proof. First let u be a nonzero linear functional on X. If a ^ ker(u) and 
X G X, then, using the linearity of u, we easily verify that x = ta + y , 
with t G F and y G ker(u), if and only if t = u{x)/u{a). Hence ker(u) is a 
hyperplane. 

Conversely, let H he & hyperplane in X, and let a ^ H. For each x G X 
there exists a unique pair {t,y) in FxiL such that x = ta + y. Setting 
u{x) = t and f{x) = y, we define functions u : X ^ F and / : A — >• iJ. If 
also x' G X, then 

x + x = {u{x) + u{x')) a + f{x) + f{x'), 

where u{x) +u{x') G F and (as iL is a linear subset of X) f{x) + f{x') G H; 
the uniqueness of the representation of a given element of X relative to 
(H,a) ensures that u{x + x') = u{x) + u{x'). Similar uniqueness arguments 
show that u(Ax) = Au(x) whenever A G F and x G X, and that u{a) = 1. In 
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particular, it follows that u is a linear functional on X. Moreover, u{x) = 0 
if and only if a: = f{x) G H; so ker(M) = H. It remains to prove that u 
is the unique linear functional on X which takes the value 1 at a and has 
kernel H . But if v is another such linear functional on X , then for each 
X G X we have 



v{x) = V (u(x)a + f{x)) 

= u{x)v{a) + v{f{x)) 
= u{x) 1 + 0 
= u{x). □ 



(4.2.5) Exercises 

.1 Let H he & hyperplane in a normed space X, a G X\H, and a G R. 
Prove that there exists a unique linear functional u on X such that 
a + H = {x G X : u(x) = a} . 

.2 Let M be a nonzero bounded linear functional on a normed space X, 
and H — ker(w). Show that p{x, H) = ||u|l~^ |w(a:)| for each x G X. 

.3 A translated hyperplane^ in a normed space AT is a subset of the 
form V + H where iL is a hyperplane in X and v G X. Prove that 
a translated hyperplane is closed if and only if its complement has a 
nonempty interior. 

.4 Let K he & subset of a normed space X, and u a linear functional on 
X. For each a G R the translated hyperplane 

Ha = {x G X u(x) = a} 

is called a hyperplane of support for K if 

— there exists xq G K such that u{xo) = a, and 

— either u{x) > a for all x G K or u{x) < a for all x G K. 

Prove that if K is compact and is not contained in any Ha, then for 
exactly two real numbers a, Ha is a hyperplane of support for K. 
(Consider the set {t G R : u~^{t) fl AT 0} .) 



1 



Some authors use the term “hyperplane” for a translated hyperplane. 




4.3 Finite-Dimensional Normed Spaces 



189 
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Before studying some of the more important infinite-dimensional spaces 
in analysis, we devote a section to the major analytic properties of finite- 
dimensional spaces. We begin by showing that for any positive integer n, 
any normed space of dimension n over F can be identified with the product 
space F". 

(4.3.1) Proposition. If X is an n-dimensional normed space with basis 
{ei, e„}, then 

n 

(Cl) ■ ■ ■ ) Cn) ^ ^ 

i=l 

is a one-one bounded linear mapping of the product space F" onto X with 
a bounded linear inverse. 

Proof. Let / denote the mapping in question. It is easy to verify that / 
is one-one and maps F" onto X, and that both / and f~^ are linear. Let 

c = max ||ed| . 

Ki<n 



The inequalities 






n n 

< m;K ICil 

Kt<n 

2=1 2=1 



show that / is bounded and therefore continuous. Let 

^=|(Ci,---,Cn)GF": max |C*| = • 

I l<2<n I 

Then S is closed (Exercise (4.3.2: 1)) and bounded, and is therefore compact 
(see Exercise (3.5.11:6)). Now, the mapping f >->• ||/(C)|| is continuous and 
(as {ci, . . . , e„} is a basis) maps S into R’*'; so, by Exercise (3.3.7: 2), 



0<r = inf{||/(?)||:^G5}. 

If f is any nonzero element of F", then, setting rj = ||CI|~^ C> have p G S 
and therefore 

r-<ii/(r/)ii = iicir' 11/(011- 

Hence ||C|| < r~^ ||/(0ll • Since this holds trivially when ^ = 0, we see that 
r~^ is a bound for the linear mapping f~^. □ 



(4.3.2) Exercises 

.1 Prove that the set S in the preceding proof is closed. 
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.2 Show that if X is n-dimensional with basis {ei,...,e„}, then the 
mapping 

n 







is a norm on X, and that X is complete with respect to this norm. 



.3 Find an alternative proof that the mapping / ^ in the proof of 
Proposition (4.3.1) is continuous. 



.4 Prove that any linear mapping from a finite-dimensional normed 
space into a normed space is bounded. Hence prove that any two 
norms on a given finite-dimensional linear space are equivalent. 



(4.3.3) Proposition. A finite- dimensional normed space is complete. 

Proof. Let AT be a finite-dimensional normed space. We may assume that 
X yf {0}, SO that X has a basis {ei, . . . ,e„}. Let / be the mapping in Propo- 
sition (4.3.1), and let (x„) be a Cauchy sequence in X. Then (f~^{xn)) is 
a Cauchy sequence in F" and therefore (see Exercise (3.5.11: 7)), converges 
to a limit y G F". Since / is continuous, (x„) converges to f{y) £ X. □ 

(4.3.4) Corollary. A finite-dimensional subspace of a normed space X 
is closed in X. 

Proof. This is an immediate consequence of Propositions (4.3.3) and 
(3.2.9). □ 

Our next result is surprisingly useful. We use it to simplify the proof of 
Theorem (4.3.6). 

(4.3.5) Riesz’s Lemma. Let S he a closed subspace with a nonempty 
complement in a normed space X, and let Q < 9 < 1. Then there exists a 
unit vector x £ X such that \\x — y\\> 9 for each y £ S. 

Proof. Fix xq £ X\S. By Exercise (3.1.10: 3), 

0 < r = p(xo, S) < 9~^r. 

Choosing sq £ S such that 

r < ||xo - Soil < 



let 



X = ||a;o - Soli ^ (xq - sq). 
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Then ||a;|| = 1. Also, for each s G S, 

So + Iko - Soil s G S', 



so 



Iko - Soli ||x - s|| = llxo - (so + llxo - Soil s)|| > p{xo, S) = r, 



and therefore 



X - s > 



||xo - Soil 



> e. 



□ 



It follows from Riesz’s Lemma that in an infinite-dimensional normed 
space A, if 0 < 0 < 1, then there exists a sequence (x„) of unit vectors such 
that \\xm — x„|| > 9 whenever m ^ n (see Exercise (4.3.7: 4)). This result 
can be improved in various ways. For example, in Chapter 6 we prove that 
in any infinite-dimensional normed space there exists a sequence (x„) of 
unit vectors such that \\xm — x„|| > 1 whenever m yf n. A much deeper 
result, due to Elton and Odell, says that if X is an infinite-dimensional 
normed space, then there exist e > 0 and a sequence (x„) of unit vectors 
in X such that ||xm — x„|| > 1 + £ whenever m ^ n; see Chapter XIV of 
[ 12 ]. 

We now use Riesz’s Lemma to provide a topological characterisation of 
finite-dimensional normed spaces. 

(4.3.6) Theorem. A normed space is finite-dimensional if and only if 
its unit ball is totally bounded, in which case that ball is compact. 

Proof. For simplicity, we take the case F = R. Let A be a normed space, 
and B its (closed) unit ball; we may assume that A {0}. Suppose that 
A is finite-dimensional with basis {ci, . . . ,e„}, and let u be the one-one 
linear mapping (Ci)---)Cn) of ^ onto the product metric 

space R". By Propositions (4.3.1) and (4.2.1), there exists R > 0 such that 
if II Er=i?»ei|| < 1, then 



ll(Ci,---,?n)ll = l^il < i?. (1) 

l<t<n 

By the Heine-Borel-Lebesgue Theorem (1.4.6) and Proposition (3.5.10), 
[— is a compact subset of R"; it follows from Propositions (4.3.1) 
and (3.3.6) that ■u“^([— i?,i?j”) is a compact subset of A. Since B is closed 
and, by (1), a subset of u“^([— i?,i?]”), we see from Proposition (3.3.4) that 
B is compact. 

Assume, conversely, that B is totally bounded. Construct a finite ^- 
approximation F to B , let S be the finite-dimensional subspace of A gen- 
erated by F, and suppose that A y^ S'. Then, by Riesz’s Lemma (4.3.5), 
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there exists a unit vector x G X such that \\x — s|| > ^ for all s £ S] but 
this is absurd, as \\x — s|| < ^ for some s £ F. Hence, in fact, X = S. □ 

(4.3.7) Exercises 

.1 Show that if a normed space X contains a totally bounded ball, then 
every closed ball in X is compact. 

.2 Prove that a normed space X is finite-dimensional if and only if 
{x £ X : ||a;|| = 1} is compact. 

.3 Prove that a normed space is locally compact if and only if it is 
finite-dimensional. 

.4 Let X be an infinite-dimensional normed space. Use Riesz’s Lemma 
to construct, inductively, a sequence (xn) of unit vectors in X such 
that for each n, 

(i) xi, . . . ,x„ are linearly independent, and 

(ii) p{xn+i,X„) > i, where = span {xi, . . . , Xn} ■ 

Hence prove that the unit ball of X is not compact. 

This provides us with another proof that if the unit ball of a normed 
space is compact, then the space is finite-dimensional. 

.5 Let X he a metric space, x £ X, and S a nonempty subset of X. 
A point b £ S is called a best approximation, or a closest point, 
to x in S' if p{x,b) = p{x,S). Prove the Fundamental Theorem of 
Approximation Theory, if AT is a finite-dimensional subspace of a 
normed space X, then each point of X has a best approximation in 
X. (See [10], [38], or [52] for further information about approximation 
theory, a major branch of analysis with many important practical 
applications.) 

.6 Prove that any hyperplane in a finite-dimensional normed space is 
closed. 

Now let X be the subspace of cq consisting of all sequences {xn)^=i 
of real numbers such that x„ = 0 for all sufficiently large n. Show 
that 

OO 

f{{Xu)n=l) = 

n—1 



defines a linear functional / : AT — >■ R whose kernel is not closed in 
X. 
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.7 Let S' be a nonempty closed subset of R^, and K, B closed balls in 
such that (i) B C K and (ii) K intersects S in a single point C, 
on the boundary of K. If B does not intersect the boundary of K, let 
^ be the centre of B\ otherwise, B must intersect the boundary of K 
in a single point, which we denote by For each positive integer n 
let 

K^ = l{i-0+K. 

Prove that for all sufficiently large n we have B C and fl S = 
0. Hence prove that there exists a ball K' that is concentric with 
K, has radius greater than that of iF, and is disjoint from S. (For 
the first part, begin by showing that there exists a positive integer 
V such that B C for all n > v. Then suppose that for each 
n>v there exists Sn G Kn fl S. Show that there exists a subsequence 
(s„^)^i converging to (, and hence find k such that £ K (1 S, a, 
contradiction.) 



A sequence in a normed linear space X is said to be total if it generates a 
dense linear subspace of X — that is, if the linear space consisting of all finite 
linear combinations of terms of the sequence is dense in A . In that case X 
is separable. To see this, let (a„) be a total sequence in X, and let S be the 
set of all finite linear combinations riOi + • • • + r„a„ with each coefficient 
rk rational. (By a rational complex number we mean a complex number 
whose real and imaginary parts are rational.) If Ai, . . . , A„ are in F, then 
there exist rational elements ri, . . . , r„ of F such that X)fc=i ~ ll^fcll 
is arbitrarily small; since 



n n 

AfeOfc — ^ Tfettfe 
fc=i fc=i 



< E |Afe - Tfcl ||a/c|| 

k=l 



and (a„) is total, it follows that S is dense in X; but S is countable. 
We have the following converse. 



(4.3.8) Proposition. If X is an infinite- dimensional separable normed 
space, then it has a total sequence of linearly independent vectors. 



Proof. Let (a„) be a dense sequence in X, and assume without loss 
of generality that oi 0. We construct inductively a strictly increasing 
sequence 1 = ni < ri 2 < • • • of positive integers such that for each k, 

(i) the vectors , • . • , an,, are linearly independent, and 

(ii) for 1 < TO < rifc, is a linear combination of a„j, . . . , a„^. 

Indeed, if a„j , . • . , a„;. have been constructed with properties (i) and (ii), we 
take Uk+i to be the smallest integer m > Uk such that Um does not belong 
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to the subspace Xk of X generated by {a„i, . . . (If no such integer 

exists, then, being closed by Proposition (4.3.4), Xk contains the closure 
of the subspace of X generated by the dense sequence (a„), so X = Xk 
is finite-dimensional — a contradiction.) It now follows from (ii) that the 
sequence (un^,)^! is total in X. □ 

(4.3.9) Exercises 

.1 Prove that the Banach spaces cq and l± are separable. 

.2 Show that the Banach space loo is not separable. (Consider the set 
of elements of loo whose terms belong to {0, 1} .) 



4.4 The Lp Spaces 

In this section we introduce certain infinite-dimensional Banach spaces of 
integrable functions that appear very frequently in many areas of pure and 
applied mathematics. For convenience, we call real numbers p, q conjugate 
exponents if p > 1, g > 1, and l/p+ ^/q = 1- 
We begin our discussion with an elementary lemma. 

(4.4.1) Lemma. If x,y are positive numbers and 0 < a < 1, then 

^Ciyl-a < 

Proof. Taking u = x/y, consider 

f{u) = — au — 1 + a. 

We have f'{u) = a{u^~^ — 1), which is positive if 0 < u < 1 and negative 
if u > 1. Since /(I) = 0, it follows from Exercise (1.5.4: 7) that f{u) < 0 
for all M > 0. This immediately leads to the desired inequality. □ 

(4.4.2) Proposition. Letp,q he conjugate exponents, and f,g measur- 
able functions on R such that \f\^ and |y|^ are integrable. Then fg is 
integrable, and Holder’s inequality 




holds. 

Proof. We first note that if / |/|^ = 0, then \ ff = 0 almost everywhere; 
so / = 0, and therefore fg = 0, almost everywhere. Then fg is integrable, 
f fg = 0, and (1) holds trivially, as it does also in the case where / |y|^ = 0. 
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Thus we may assume that / \ ff > 0 and / \g\‘^ > 0. We then have, almost 
everywhere, 

l/g| ^ ( \gV 

ij\ffy^^{j\g\y^" v/i/ry v/igiv 

/ i/r , igr 

-p/l/r %/|g|’ 

(where the last step uses Lemma (4.4.1)), so 




Now, fg is measurable and the right-hand side of (2) is integrable. Hence, 
by Proposition (2.3.1), fg is integrable and 




from which (1) follows. □ 

(4.4.3) Proposition. Let p > 1, and let f,g be measurable functions on 
R such that \f\^ and \g\^ are integrable. Then \ f + g\^ is integrable, and 
Minkowski’s inequality 




holds. 

Proof. Clearly, we may assume that p > 1. Now, \ f + gf' is measurable, 
by Exercise (2.3.3: 5). Since 

1/ + pr < (2max{|/| , |p|})P < 2^> (I/r + IpH 

and the last function is integrable, it follows from Proposition (2.3.1) that 
1/ + ^r is integrable. The functions |/| and \f + gf~^ are measurable, by 
Exercise (2.3.3: 5), and 

(\f + gf = \f + gf & LifR). 

Thus, by Proposition (4.4.2), \ f + g\^~^ \ f\ is integrable and 

/i/+.r'm<(/i/+»r)""‘(/i/r)". 
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Similarly, \ f + gf ^ |ff| is integrable and 
It follows that 

J \f + gf = J \f + gf~^\f + 9\ 

< J \f + gf~^\f\ + J \f + gf~^\g\ 

from which we easily obtain Minkowski’s inequality. □ 

(4.4.4) Exercises 
.1 Prove Holder’s inequality 

N 

XnVn 

n—1 

and Minkowski’s inequality 

N / N /AT \ 

n=l / \n=l / \n=l / 

for finite sequences xi, . . . , xn and yi, . . . ,yN of real numbers. 

.2 A sequence (x„) of real numbers is called p-power summable if the 
series \xn\^ converges. Prove that if (x„) is p-power summable 

and (p„) is g-power summable, where p, q are conjugate exponents, 
then 




(i) XnVn is absolutely convergent, and 

(ii) Holder’s inequality holds in the form 



OO 

'^XnPn 

n—1 



< 





Prove also that if (x„) and (y„) are both p-power summable, then so 
is (x„ + Pn ) , and Minkowski’s inequality 



El 

\n—l 



i/p 



< 





i/p 



holds. 
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.3 Let p > I, and let Ip denote the set of all p-power summable se- 
quences, taken with termwise addition and multiplication-by-scalars. 
Prove that 

/ oo \ i/p 

ll(a;n)r=illp = ( 

defines a norm on Ip. (We define the normed space lp{C) of p-power 
summable sequences of complex numbers in the obvious analogous 
way.) 



Let X be a measurable subset of R, and p > 1. We define Lp{X) to be 
the set of all functions /, defined almost everywhere on R, such that / is 
measurable, / vanishes almost everywhere on R\X, and |/|^ is integrable. 
Taken with the pointwise operations of addition and multiplication-by- 
scalars, Lp{X) becomes a linear space. If we follow the usual practice of 
identifying two measurable functions that are equal almost everywhere, 
then 




is a norm, called the Lp-norm, on Lp{X). (We met the normed space Li(R) 
in Exercise (4.1.1: 6).) 

When X = [a,b] is a compact interval, we write Lp[a,b] rather than 

Lp ([a, 6]) . 

(4.4.5) Exercises 

In these exercises, X is a measurable subset of R. 

.1 Let X be integrable and 1 < r < s. Prove the following. 

(i) Ls{X) C Lr{X). (Note that if / G La{X), then \ff G Lg/p(X).) 

(ii) The linear mapping /!—>■/ of Lg{X) into Lp(X) is bounded and 
has norm < p{Xy \ 



.2 Let l<r<t<s< oo, r y s, 

r~^ — 

r~i _ g-i ’ p-i _ g-i 

and / G Lr{X) iT Lg{X). Prove that / G Lt{X) and 

ll/ll. < 11/11“ ll/llf- 



(Consider |/|“‘ |/|^* .) 
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.3 Prove that the step functions that vanish outside X form a dense 
subspace of Lp{X) for p > 1. (First consider the case where X is a 
compact interval.) 

.4 Let p,q be conjugate exponents, and let f,g £ Lp{X). Prove that if 
1 < p < 2, then 

2 (ll/ll^ + ll5ll^)'”' >!!/ + < + 11/ 

and 

11/ + 5llp + 11/ - g\\p > 2 (^11 /lip + llffllp) , 

and that the reverse inequalities hold ifp > 2. {Clarkson’s inequalities. 
Use Exercise (1.5.8: 10).) 

.5 Use the preceding exercise to prove that if p > 1, then Lp{X) is 
uniformly convex. (See Exercise (4.2.2: 15).) 



(4.4.6) The Riesz— Fischer Theorem. Lp{X) is a Banach space for all 
p > 1. More precisely, if {fn) is a Cauchy sequence in Lp{X), then there 
exist f € Lp{X) and a subsequence (/n;,)^i of {fn) such that 

(i) lim„_>oo 11/ - fnWp = 0, and 

(ii) fnk — >■ / almost everywhere on X as k ^ oo. 



Proof. We illustrate the proof with the case X = R and p > 1 . Given a 
Cauchy sequence (/„) in Lp(R), choose a subsequence {fnk)T=i 

ll/m - /nllp < 2 "'= {m,n>nk). 

Then 

||/nib+i ~ fuk lip < 2 

Writing q = p/ (p— 1), we see from Proposition (4.4.2) that for each positive 
integer N, \fnk+i ~ /«* | is integrable over [— TV, iV], and 



J I / »fc+i ^ \\fnk+i fnk\\p(^J 

< 2-'=(2fV)i/«, 



1/9 







SO the series 
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converges. It follows from Lebesgue’s Series Theorem (Exercise (2.2.13:4)) 
that there exists a set ifjv of measure zero such that the series 

OO 

\fnk+l(x) - fndx)\x[_M,N] (^) 

k^l 

converges for all x G II\En, and the function J2T=i |/«fc+i “ f^k \ X[_jv n] 
is integrable. Then 

OO 

E= \J En 

N^l 

is a set of measure zero, and 

OO 

°° k=l 

exists for all x G TV\E. The function / so defined is measurable, by Exercise 
(2.3.3: 4). Since 

Wfuk lip ^ ll/ni lip + ll/rifc ~ fni ||p < ||/ni ||p + 2 

for all k, we see from Fatou’s Lemma (Exercise (2.2.13: 11)) that \ff is 
integrable and hence that / G Lp(R). Moreover, if n > n^, then by applying 
Fatou’s Lemma to the sequence (|/„^ — /n|)^j we see that 

ll/-/«llp = ^lm ll/n.-/n||p<2-b 

Hence lim„^oo 11/ - /nllp = 0. □ 

(4.4.7) Exercises 

.1 Prove the Riesz-Fischer Theorem for a general measurable set X C 
R. Prove it also in the case p = 1. 

.2 Prove that the space Ip is complete for p > 1. 

In order to establish an elegant characterisation of bounded linear func- 
tionals on Lp{X), we first discuss those functions whose derivatives almost 
everywhere belong to Lg(R). 

(4.4.8) Lemma. Let I = [a, 6] he a compact interval, q > 1, and G a 
real-valued function defined almost everywhere on R and vanishing outside 
I. Then the following conditions are equivalent. 

(i) There exists g G Lg(R) such that G'{x) = g{x) almost everywhere. 
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(ii) There exists M > 0 such that 



|G(a:fc+i) — G(xfc)|‘^ 
^ (Xfc+i - Xfc)9-1 



whenever the points Xk & I and x\ < X 2 < ■ ■ ■ < Xn- 
In that case, the smallest such M is J \g\'^ . 



Proof. Writing p = q/(q — 1), suppose that (i) holds, and let a < xi < 
X 2 < ■ ■ ■ < Xn < b. Applying Proposition (4.4.2) to the functions xi and 
Xig, we have 



\G{xk+i) - G{xk)\ 




9 



< 




Hence 



rOOk + 1 

|G(a:fc+i) - G(ccfc)|‘^ < {xk+i - XkY~^ / . 

J Xk 



and therefore 



|G(a;fc+i) — G(a;fc)|® 
^ {Xk+l - XkY~^ 




l5l^< 




Thus (ii) holds, and the smallest M for which (ii) holds is at most f \gf . 

Now suppose that (ii) holds, and let ((afc,&fc))^i be a finite sequence 
of nonoverlapping open subintervals of I. Applying Exercise (4.4.4: 1), we 
obtain 



E |C(W - CMI = E 






HE 






\k—l 



{bk - akY 



-1 



Y^^bk 



^fc=i 



N 



1/p 






\k=l 




1/p 



Hence G is absolutely continuous. It follows from Exercise (2.2.17: 2) that 
there exists an integrable function g such that G'{x) = g{x) almost 
everywhere. 
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For each positive integer n let 

Xn,k = a + ^{b- a) (0 < /c < 2”) 
and define a step function g„ by setting 



9n{x) 



^n,fc+l 



if Xn^k ^ X ^ Xji^li-^i 



0 



otherwise. 



Then g{x) = lim„_>oo ^„(x) almost everywhere — to be precise, on the 
complement of the union of 



{xn,k ■ n>l, 0 < k < 2"} 
and the set of measure zero on which G' = g . Also, 




\G{x„^k+l) - G{Xn,k)\‘^ 

{xn,k+i - 



< M. 



Applying Fatou’s Lemma, we now see that \g\‘‘ is integrable and / 1^1“^ < M. 
Hence (ii) implies (i) . Referring to the last sentence of the first part of the 
proof, we also see that / |g|'^ is the smallest M for which (ii) holds. □ 



(4.4.9) Lemma. Let u he a bounded linear functional on Lp[a, 6], and 
define 



G{x) = 






if a < X < 6 
otherwise. 

-\n— 1 



Let a < x\ < X 2 < ■ ■ ■ < Xn < b, and let f = '^k^i ^kX\ 1 • Then 

n—1 

u{f) ^'^Ck {G{xk-\-i) - G{xk )) . 






Moreover, if there exists g G Lq[a,b] such that G' = g almost everywhere, 
then u{f) = f fg. 



Proof. We have 

n—1 
n— 1 
n—1 

= ^ Cfc (G(xfc+l) - G(xfc)) . 
fc=l 
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Now suppose that G' = g almost everywhere for some g G Lq[a, 6]. Then 



n—l 

^(/) = / 

fc=l 



a^fc+1 



n—l 

= E 



rxk+i 



^kg 



fc=i 




□ 



We now show that if p, q are conjugate exponents, then the dual space 
L* can be identified with Lq. 

(4.4.10) Theorem. Let p, q be conjugate exponents. Then for each g G 

Ug(f) = j fa 

defines a bounded linear functional on Lp{X) with norm equal to ll^Hq. 
Conversely, to each bounded linear functional u on Lp{X) there corresponds 
a unique g G Lg(X) such that u = Ug. 



Proof, li g G Lq{X), then by Lemma (4.4.2), Ug is well defined on Lp{X). 
It is trivial that Ug is linear, and Holder’s inequality shows that ||g||^ is a 
bound for Ug. On the other hand, taking 



we see that / G Lp(X) and 

%(/) = y /ff = y = 

=ii5ii,(y ' 

HMi,(yi/r)" 

= NUI/II,- 




<? 



Hence ||Mg|| = . 

Now consider any bounded linear functional u on Lp(X). To begin with, 
take the case where A1 is a compact interval [a, b]. If u = Ug for some 

g G Lq{X), then u = la a for each x G X. This suggests that we 

define 

u 

G{x) = { 



(x[a..]) ifa^G^ 



0 if X G R\X 



and try to show that G' G Lq{X) and that u{f) = / fG' for all / G Lp{X). 
To this end, let a < xi < X 2 < • • • < < 5. Let be the step function 
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that vanishes outside [xi^Xn] and at each Xi, and that takes the constant 
value 



\G{xk+i) - G{xk)\‘^ ^ sgn {G{xk+i) - G{xk)) 
{Xk+l - Xfc)9-1 



on (xk,Xk+i) , where 



f 1 if x > 0 



sgn (x) = < 



0 if a; = 0 

— 1 if a; < 0. 



Then, by Lemma (4.4.9), 

n—1 

E 



\G{xk+i) - G{xk)\'^ ,,, 

33— = U(0) 



k=l 



{xk+1 - XkY 



< k 



r lip 
^n—1 



i/p 



= Ikll I^E \ck\^ i.^k+1 -Xk)j 

II w ( \^ |G(a;fc+i) - G(xfe)|^ 

= (g («« - X.).-. 



1/p 



and therefore 

|G(a:fc+i) - G(xfc)!'^ , 

Thus, by Lemma (4.4.8), there exists g G Lq{X) such that G' = g almost 
everywhere and |k||^ < ||m|| . It follows from Lemma (4.4.9) that u{f) = 
f fg for each step function / that vanishes outside X. The set of such step 
functions is dense in the space Lp{X), by Exercise (4.4.5: 3); moreover, the 
linear functionals u and f '-G- J fg are bounded, and therefore uniformly 
continuous, on Lp{X). Referring to Proposition (3.2.12), we conclude that 

U = Ug. 

It remains to remove the restriction that X be a compact interval and 
to prove the uniqueness of g for a given u. This is left as an exercise. □ 



(4.4.11) Exercises 

.1 Complete the proof of Theorem (4.4.10) by removing the restriction 
that X be a compact interval, and by proving the uniqueness of the 
function g for a given u. 
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.2 A measurable function / on R is said to be essentially hounded if 
there exists M > 0 such that |/(a;)| < M almost everywhere. Prove 
that 

ll/lloo = 0 ■ l/(^)l — ^ almost everywhere} 

defines a norm on the vector space of all essentially bounded 
functions under pointwise operations, and that Loo is a Banach space 
with respect to this norm (where, as usual, we identify measurable 
functions that are equal almost everywhere). 

The real number \\f\\^ is called the essential supremum of the 
element / of Lao- 

.3 Let AT be a compact subset of R, and / : AT — >■ R a continuous 
function. Extend / to R by setting f{x) = 0 if x G R\AT. Prove that 
f G Lao and that ||/||^ = sup,,^^ |/(x)| . 

.4 Prove that if / G Li and g G Lao, then fg G Li and Holder’s 
inequality holds in the form 

ll/5lll<ll/lllllfflloo- 

.5 Prove that for each g G Lao, 

ugif) = j fa 

defines a bounded linear functional on L\ with norm equal to ||(/|loo > 
and that every bounded linear functional on Li has the form Ug for 
a unique corresponding g G Lao- 

.6 Let 0 < p < 1, and let Lp consist of all measurable functions / on 
R such that \ ff is integrable. Show that when we identify functions 
that are equal almost everywhere, 

Pp{f,9) = J \f-gf 

defines a metric on Lp, and that (Lp,pp) is a complete metric space. 
Show also that the only continuous linear mapping from Lp (with 
pointwise operations) to R is the zero mapping. 

4.5 Function Spaces 

Among the most important examples of Banach spaces are certain subsets 
of the space B{X,Y) of bounded functions from a nonempty set X into a 
Banach space Y, where the norm on B{X, Y) is the sup norm 

ll/ll = sup{||/(a:)|| :xGX}. 
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Note that when X is a compact interval [o, b\, we usually write B[a, b] rather 
than B{[a,b\); we use similar notations without further comment in related 
situations. 

A special case of the next result has appeared already (Exercise 
(4.1. 6:4)). 

(4.5.1) Proposition. IfY is a Banach space, then B{X,Y) is a Banach 
space. 

Proof. Let (/„) be a Cauchy sequence in B{X,Y), and e > 0. There 
exists N such that \\fm ~ fn\\ < s for all m,n > N. For each x G X we 
have 

\\fm{x) - fn{x)\\ < Wfm ~ /nil < £ 

whenever m,n> N] so {fn{x))^^i is a Cauchy sequence in Y. Since Y is 
complete, 

f{x) = lim fn{x) 

n—^oo 

exists; also, for all m> N, 

||/m(a;) - /(x)|| = lim \\ fm{x) - fn{x)\\ < e . (1) 

n—^oo 

H6I1C6 

||/(x)|| < ||/^(X)|| + Wf^ix) - f{x)\\ < ll/^ll +£. 

Since x G X is arbitrary, we see that / G B{X, Y). Also, it follows from (1) 
that II fm — f II < e for all m> N. Since £ > 0 is arbitrary, ( f„) converges 
to / in B{X, Y). Hence B{X, E) is complete. □ 

(4.5.2) Exercises 

.1 Let E be a finite-dimensional Banach space, and {ei, . . . , Cn} a basis 
of Y. Prove that each / G B{X, Y) can be written uniquely in the 
form X i-G X)fc=i fk{x)ck with each fk G B{X, F). Prove also that for 
each A, / i-G /fc is a bounded linear mapping of B{X, Y) into B{X, F). 

.2 Prove that the mapping / i-G sup^g^ /(2^) of yB(A, R) into R is 
continuous. 

.3 Let y be a Banach space, and fn ^ series in B{X,Y). Let 

On be a convergent series of nonnegative real numbers such that 
ll/nll < c„ for each n. Show that /n converges in the Banach 

space B{X, Y). 

.4 Let / = [a,b] be a compact interval, and BV{I) the linear space of 
all real-valued functions of bounded variation on /, with pointwise 
operations. Show that 



ll/llbv = !/(«) I + ^/(fo^) 
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defines a norm on BV{I), and that BV{I) is complete with respect to 
this norm. (For the second part recall Exercise (1.5.15:6).) 

Let /, /i, / 2 , . . . be mappings of a nonempty set X into a metric space 
{y,p)- We say that the sequence (/„) 

• converges simply to / on AT if for each x € X the sequence (/n(a;)) 
converges to f{x) in Y; 

• converges uniformly to / on X if 

sup p{fn{x),f{x)) 0 as n -)> oo. 

x&X 



Clearly, uniform convergence implies simple convergence; but, as the next 
exercise shows, the converse is false. 

(4.5.3) Exercises 

.1 Give an example of a sequence (/„) of continuous mappings from 
[0, 1] into [0, 1] that converges to 0 simply, but not uniformly, on [0, 1]. 
(Consider a spike of height 1 travelling along the x-axis towards 0.) 

.2 Let F be a normed space. Prove that a sequence (/„) in B{X, Y) 
converges to a limit / in the normed space B{X,Y) if and only if 
(/n) converges uniformly to / on X. 

.3 Let X he a, compact metric space, and (/«), (gn) strictly increasing 
sequences of real-valued functions on X that converge simply to the 
same bounded function / : X — >■ R. Show that for each m there exists 
n such that fm < g-n (that is, fm{x) < gn{x) for all x € X). Show 
also that we cannot omit “compact” from the hypotheses. 

Now let (X, p) be a metric space, and Y a normed space. The set of all 
continuous mappings of X into Y is denoted by C{X,Y) or Cy{X), and 
the set of all bounded continuous mappings of X into Y by C°°{X,Y) or 
C^(X); so 

C°°{X,Y) = B{X,Y)nC{X,Y). 

If X is compact, then C°°{X, Y) = C{X, F), by Exercise (3.3.7: 1). In gen- 
eral, C°°{X, F) is a linear subspace of B{X, F); we consider it as a normed 
space, taken with the sup norm. 

We usually write C^{X) and C{X), respectively, instead of C°°(AT, R) 
and C{X, R). 

(4.5.4) Proposition. If X is a metric space and Y a Banach space, then 
C°°(X, F) is a closed, and therefore complete, subspace of B{X,Y). 
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Proof. Let (/„) be a sequence of elements of C(X,Y) converging to a 
limit / in B{X, Y). For each e > 0 there exists N such that ||/ — /n|| < ^/3 
whenever n > N. Given xg in X, construct a neighbourhood U of Xg such 
that if X G U, then \\f]\[{x) — /iv(a^o)|| < s/3. For each x GU vie then have 

||/(a;) - f{xg)\\ < \\f{x) - fN{x)\\ + ||/at(x) - /jv(a;o)|| 

+ ll/w(a:o) - /(a:o)|| 

< ll/-/iv|| + | + ||/Ar-/|| 

= e. 

Since e > 0 and xg G X are arbitrary, it follows that / is continuous on X. 
Thus C°°{X,Y) is closed in B{X,Y); whence, by Propositions (4.5.1) and 
(3.2.9), C°°{X,Y) is complete. □ 

Proposition (4.5.4) shows that a uniform limit of hounded continuous 
functions is continuous. Taken with Exercise (4.5.3: 1), this observation 
highlights the significance of the next theorem. 

(4.5.5) Dini’s Theorem. Let X he a compact metric space, and (fn) an 
increasing sequence in C{X) that converges simply to a continuous function 
f. Then (fn) converges to f uniformly. 

Proof. Let e > 0. For each x G X there exists such that if n > N^, 
then 0 < f{x) — fn{x) < e/3. Since / and /at^ are continuous, there exists an 
open neighbourhood Ux of x such that if x' G Ux, then \f{x) — f{x') \ < e/3 
and IfN^x) - fNAx')\ < s/3] whence 

0 < f{x') - fN^x') 

< \f{x) - f{x')\ + f{x) - fN^x) + IfN^x) - 

< 1 + 1+1 

= e. 

Since X is compact, there are finitely many points xi,. . . ,Xi/ of X such 
that the neighbourhoods Uxi cover X. Setting 

ns = max{7Va;. : I <i <n}, 

consider n > ns. Given x G X, choose i such that x G UxP, then 

0 < f{x) - fn{x) < f{x) - fnsix) < f{x) - fN,,.{x) < £. 

Since £ > 0 and x G X are arbitrary, we conclude that (/„) converges to / 
uniformly. □ 



(4.5.6) Exercises 
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.1 Show that “increasing” can be replaced by “decreasing” in Dini’s 
Theorem. 

.2 Give an alternative proof of Dini’s Theorem using the sequential 
compactness of X. 

.3 Let X be a metric space, Y a Banach space, and D a dense subset of 
X. Let (/„) be a sequence of bounded continuous mappings of X into 
Y such that the restrictions of the functions /„ to D form a uniformly 
convergent sequence. Prove that (/„) is uniformly convergent on X. 

.4 Let X be a metric space, and Y a normed space. Prove that the 
mapping (x, /) !->■ f{x) is continuous on X x C°° (X, Y) . 

.5 Let / be a compact interval in R, and (/„) a sequence of increasing 
real functions on / that converges simply in / to a continuous function 
/. Prove that / is increasing and that (/„) converges to / uniformly 
on /. 

.6 Let a, b be real numbers with 6 > 0, and let X be the set of all 
continuous mappings / : [0, 6] — >■ R such that /(O) = a. Prove that 
X is complete with respect to the sup norm. 

.7 Let / be a compact interval in R, xq € I, and a > 0. Show that 

||/||' = sup{e-“l"-"«l|/(x)|:x€/} 

defines a norm on C{I), and that C{I) is complete with respect to this 
norm. 

.8 In the notation of Exercise (4.5.2: 4) prove that if a sequence (/„) 
converges to a limit / with respect to the norm on BV{I), then 
it converges to / with respect to the sup norm on B{I). 

With I = [0, 1] find a sequence in BV{I) nC(/) that 

(i) converges to a limit / S C{I) with respect to the sup norm, and 

(ii) is not a Cauchy sequence with respect to • 

(Note Exercise (1.5.15:4).) 

Let X be a metric space, Y a normed space, and X a subset of B{X, Y). 
We say that T is 

• equicontinuous at a & X \i for each e > 0 there exists 5 > 0 such that 
||/(x) — /(a) II < e whenever f £ tF and p{x, a) < S, 

• equicontinuous {on X) if it is equicontinuous at each point of X; 
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• uniformly equicontinuous if for each e > 0 there exists 5 > 0 such that 
||/(x) — f{y)\\ < s whenever f G x G X, y G X, and p{x,y) < 6. 

Clearly, uniform equicontinuity implies equicontinuity, and if T is equicon- 
tinuous at a, then each f G T Ss, continuous at a. 

(4.5.7) Exercises 

In these exercises, X, Y, and T are as in the first sentence of the last 
paragraph. 

.1 Suppose that there exist constants c > 0 and A > 1 such that 

ll/(a;) - f{y)\\ < cp{x,y)^ 

for all f G X and all x^y G X. Show that T is uniformly 
equicontinuous. 

.2 Let a > 0, and let T be the set of all mappings / : [0, 1] — >■ R such 
that f exists, is continuous, and has sup norm at most a. Show that 
T is uniformly equicontinuous. 

.3 Show that {a;” : n G N} is not equicontinuous at 1. 

.4 Let (/„) be an equicontinuous sequence of real- valued functions on 
X. Prove that the sequence 

(fiVf2V---VfJZi 

is also equicontinuous. 

.5 For each A G L let .Fy C B(X,Y) be equicontinuous at a. Prove 
that if L is a finite set, then Uagz, equicontinuous at a. Give an 
example where L is an infinite set and is not equicontinuous 

at a. 

.6 Let (/„) be a sequence of functions In B{X,Y) that converges simply 
to a function / and is equicontinuous at a G X. Prove that / is 
continuous at a. Hence prove that the closure of an equicontinuous 
set in C°° (X,Y) is equicontinuous. 

.7 Prove that if X is compact and T C C{X, Y) is equicontinuous, then 
T is uniformly equicontinuous. 

.8 Suppose that X is compact, and let (/„) be a convergent sequence 
in C{X,Y). Prove that (/„) is uniformly equicontinuous. (Let / = 
lim„_>oo fn- Given e > 0, choose N such that ||/ — /„|| < e for all 
n > N. First find (5i > 0 such that \\fn{x) — fn{y)\\ < whenever 
p{x, y) < and n > N.) 
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.9 Suppose that X is compact, and that (/„) is an equicontinuous se- 
quence in C {X,Y) that converges simply to a function f : X ^ Y. 
Then / is continuous on X, by Exercise (4.5.7: 6). Show that (/„) 
converges uniformly to /. (Given £ > 0, use Exercise (4.5.7: 7) to ob- 
tain 6 as in the definition of “uniformly equicontinuous” . Then cover 
X by finitely many balls of the form B{x, 5).) 

.10 Let (/„) be a sequence of continuous real-valued mappings on a 
compact interval I. 

(i) Prove that if (/„) is a Cauchy sequence in C{I), then it is a 
Cauchy sequence in 7^2(7). 

(ii) Prove that if (/„) is both equicontinuous and a Cauchy sequence 
relative to the L 2 ~norm, then (/„) converges in C(/). 

(For (ii) fix to G 7 and £ > 0. Choose 5 > 0 such that if t G 7 
and \t — to\ < S, then |/„(t) — fn{to)\ < s for all n. Let x be the 
characteristic function of 7 fl [to — to + <^] > and show that 

y X(f) l/m(fo) - /n(fo)l^ dt < 

for some constant c > 0 and all sufficiently large m and n. Deduce 
that (/n(to)) is a Cauchy sequence in /.) 

Show that the equicontinuity hypothesis cannot be dropped in (ii). 



If X is compact and E is a Banach space, then we have a powerful 
characterisation of totally bounded subsets of C{X,Y). 



(4.5.8) Ascoli’s Theorem.^ Let X be a compact metric space, Y a 
normed space, and T a subset of C{X,Y) . Then T is totally bounded if and 
only if 



(i) T is equicontinuous and 

(ii) for each x € X, 

T. = {fix) :fGB} 
is a totally bounded subset ofY. 



Proof. Assume first that T is totally bounded, and let £ > 0. Construct a 
finite £-approximation {fi, . . . , f^} to if. Then for each / in IF there exists 
i such that ||/ — fi\\ < s. So for each x £ X we have ||/(a:) — /i(a:)|| < £, 
from which it follows that {/i(a;), . . . , /at ( a;)} is an £-approximation to T^. 



^This is also known as the Ascoli-Arzeld Theorem. 




4.5 Function Spaces 211 



Hence Tx is totally bounded. On the other hand, choose 5 > 0 such that if 
p{x,y) < 6, then \\fk{x) — /fc(y)|| < e for each k. With / and fi as in the 
foregoing, we have 

ll/(a;) - f{y)\\ < \\f{x) - fi{x)\\ + WMx) - f,{y)\\ 

+ \\fi{y)- f{y)\\ 

<£+£+£ 

= £. 

Hence T is equicontinuous. 

Now assume, conversely, that conditions (i) and (ii) hold, and let £ be 
any positive number. For each x £ X choose an open neighbourhood Ux 
of X such that ||/(y) — /(a:)|| < £ for each f & T and each y G Ux- Since 
X is compact, it can be covered by a finite family {Ux^, ■ ■ . ,Ux^} of such 
neighbourhoods. Now, the sets J^xi {I < i < m) are totally bounded, as 
is therefore their union K. Let {^i, . . . ,^n} be a finite £-approximation to 
K. On the other hand, let be the finite set of all mappings of {1, ... , m} 
into {!,..., n}, and for each G <j) let 

Si^ = {f G T : \\f{xi) - ^^(i)|| < £ (1 < i < m)} . 

Then for each f G T there exists (p G such that f G S(p. Since there 
are only finitely many of the sets Sip (some of which may be empty), to 
complete the proof that T is totally bounded it suffices to prove that the 
diameter of each Sp is at most 4£. To this end, consider any p G ^ and 
any two elements f,g of Sp. Given x G X, choose i such that x G [ 4 ,.. 
Then ||/(a;) - f{xi)\\ < e and \\g{x) - g(xi)\\ < e. But \\f{x,) - ^^(i)|| < e 
and || 5 (a:i) — ■C<p(i)|| ^ applications of the triangle inequality show, 

in turn, that \\f(xi) — g{xi)\\ < 2£ and ||/(x) — (/(x)|| < 4£. Since a; G X is 
arbitrary, it follows that ||/ — g|| < 4£; whence diam(S'i^) < 4£. □ 

(4.5.9) Exercises 

.1 Let X be compact, and let (/„) be a bounded equicontinuous se- 
quence of mappings of X into Y. Prove that there exists a subsequence 
{frik)T=i that converges for each x G X. (Let (x„) 

be a dense sequence in X. Setting /o,„ = /„, construct sequences 
{fi,n)'^=i (f = 0, 1, . . .) such that for all i and n, 

(i) (/i+i,„) is a subsequence of (/i,„) and 

(ii) {fi,n{xi))n=i converges in Y. 

Then show that (/„,„(x)) converges in Y for each x G X.) 

Use this result to give another proof of the “if” part of Ascoli’s 
Theorem. 
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.2 Let Co, Cl > 0, and let S consist of all differentiable functions / : 
[0,1] — >■ R such that ||/|] < cq and |1/'|| < ci. Prove that S' is a 
compact subset of C[0, 1]. 

.3 For each positive integer n and each a; > 0 let 

fn{x) = sin \/ X + 4n^7T^. 

Prove that 

(i) (/„) is equicontinuous on R®"*"; 

(ii) (/„) converges simply to 0 on R®'*'; 

(iii) (/„) is not totally bounded in C°°(R°^). 

(For the last part, show that if (/„) were totally bounded, then it 
would converge to 0 uniformly on R°^.) 

4.6 The Theorems of Weierstrass and Stone 

In this section we follow a path from the famous, and widely applicable, 
approximation theorem of Weierstrass to its remarkable generalisation by 
Stone. 

(4.6.1) The Weierstrass Approximation Theorem. If I is a com- 
pact interval in R, then the set of polynomial functions on I is dense in 
C(/). 

Thus for each f G C{I) and each e > 0 there exists a polynomial function 
p on I such that 



||/-p|| = sup{|/(a:) -p{x)\ :xGl}<s. 

In other words, each element of C(I) can be uniformly approximated, to 
any degree of accuracy, by polynomial functions. 

We derive the Weierstrass Approximation Theorem as a simple conse- 
quence of a more general theorem about linear operators on C{I). 

By a positive linear operator on C{X), where X is any metric space, we 
mean a linear mapping L : C{X) — >■ C{X) such that Lf > 0 whenever 
/> 0 . 

In the remainder of this section we let pk denote the monomial function 
X ^ x^ on R. 

(4.6.2) Korovkin’s Theorem. Let I he a compact interval in R, and 
(Ln) a sequence of positive linear operators on C{I) such that 

lim LnPk =Pk {k = 0, 1, 2). 
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Then Lnf — f / for all f G C(I). 



Proof. For each t in I let gt be the element of C{I) defined by 
gt{x) = {t- = fpo{x) - 2tpi{x) + P 2 {x). 

The linearity of implies that 

^n9t ~ ^ ^nPO 2tLjiPi + LjiP2 



whence 



0 < {Lngt) (t) 

= {{LnPo) (t) -l)-2t ((L„Pl) (t) -t)+ ((T„P 2 ) (f) - 

< \\LnPo - Toll + |2t| ||T„P 1 - Pill + ||L„P 2 -P 2 II • 

Since and |2t| are bounded on /, our hypotheses ensure that (L„gt) (t) -G 
0 uniformly on J as n — >■ 00 . We use this observation shortly. 

Given / G C(J) and e > 0, and noting the Uniform Continuity Theorem 
(Corollary (3.3.13)), choose <5 > 0 such that if x,y G I and |a; — p| < S, 
then \ f{x) — f{y)\ < s. Fix t in I, and consider any cc G /. If |t — x| > 6, 
then 

\f{t) - f{x)\ < 2 ll/ll < 2 ll/ll ^ ll/ll g,{x). 

It follows from this and our choice of S that 

< ^\\f\\ 9 t{x)+e 

for all x in I. Hence 

^ ll/ll 9t < f{t)po - f <epo+ ^ ll/ll 9t- 

Since Ln is linear and positive, we have 

2 2 
^^nPO ^ ll/ll LnPt ^ /(^) ^nPO -^n/ ^^nPO ll/ll ^nQt- 

Hence 

|/(t)(L„Po)(t) - (T„/)(t)| < e llTnPoll + ^ ll/ll {Ln9t){t). 

Thus 

|/(t)-(T„/)(t)| 

< \f{t) - fit) iLuPo){t)\ + \f{t) iLnPo){t) - {Lnf)it)\ 

< \fit)\ |1 - iLnPo)it)\ + e ||L„po|| + ^ ll/ll (i„5t)(t) 

< ll/ll Ibo - LnPoW +^(lbo|| + Ibo - LnPoW) + ^ ll/ll (in5i)W- 
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It now follows from our hypotheses, and the observation in the first 
paragraph of the proof, that ||/ — Lnf\\ — 1 0 as n — >■ oo . □ 



Proof of the Weierstrass Approximation Theorem. Without loss 
of generality, take / = [0, 1]. For each / G C(/) and each positive integer n 
define the corresponding Bernstein polynomial Bnf by 

k=0 ^ ' 



Then is a positive linear operator on C(T). Routine calculations (with 
reference to the binomial theorem) show that BnPo = Po, that BnPi = pi, 
and that 



{BnP2) (x) = - ^-”2 1 



X H — X ■ 

n 



X as n - 



It follows from Korovkin’s theorem that limy^^oo 11/ ~ ^nfW = 0 for each 



feC{i). □ 



(4.6.3) Exercises 

.1 Show that there is no loss of generality in our taking I = [0, 1] in the 
proof of the Weierstrass Approximation Theorem. 

.2 Prove that B„pq = po, that B^Pi = Pi, and that 



{BnP 2 ) (x) = 



n — 1 



-x^ + —X 

n 



X as n 



oo. 



.3 Let f{x) = x^. Calculate R„(/), and hence prove that Bn{f) — 1 / as 
n — >■ oo. 

.4 Prove that if p is a polynomial function of degree at most fc on [0, 1], 
then so is R„(p) for each n. (Use induction on k.) 

.5 Prove that there is only one positive linear operator L on C[0, 1] such 
that L{f) = f for all quadratic polynomial functions. (Use Korovkin’s 
Theorem.) 

.6 Suppose that / and /' belong to C{I), where I is a compact interval. 
Prove that for each e > 0 there exists a polynomial function p such 
that II/ — pII < e and ||/' — p'|| < £■ (Reduce to the case / = [0,1]. 
First find a polynomial q such that ||/' — g|| < £.) 

.7 Let / be a compact interval contained in (0, 1). For each / G C{I) 
and each n G N define Q„/ on / by 



(Qnf) (x) = 



k^O 



f{k/n) 



'(1-a;) 



i—k 
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where [t\ denotes the integer part of t. Prove that \\Bnf — Qnf\\ — f 0 
and hence that, on I, f is the uniform limit of a sequence of 
polynomials with integer coefficients. 

8 A function / : R — >■ C is said to be periodic if 

a = min {r > 0 : Vt G R (/(t + r) = f(t))} 

exists and is positive, in which case a is called the period of / and / 
is also said to be a-periodic. 

Prove Korovkin’s Theorem for 2n -periodic functions: let I = [— tt, tt], 
let 

V{I) = {fGC{I):fi-7T) = f{n)}, 

and let (Ln) be a sequence of positive linear operators on V{I) such 
that Lnf —1 / uniformly as n — >■ oo for / = 1, cos, and sin; then 
Lnf — ^ / uniformly for all / G P{I)- (Write z = coscc and apply 
Theorem (4.6.2).) 

9 Although this exercise mentions Fourier series, it does not require any 
knowledge of Fourier analysis. Let I = [—tt, tt]. For each / G V{I) the 
fcth partial sum of the Fourier series of f is 

k 

(Skf) (2;) = Y + X! ’ 

n— 1 

where for n > 1, 



a„ = - / 



f (t) cost dt, 
/ (t) sintdt. 



The nth Cesdro mean of the Fourier series of f is 



n— 1 



Gnf=~y' Skf. 

n 



k—Q 



Prove that 



{Gnf) {X) 



1 

2utt 



r fit +x) 

J — TT 



sin |nt 
sin if 



2 

dt. 



and hence that is a positive linear operator on P(/). Then prove 
that for each / G P{I), (Gnf) converges to / uniformly on /. (Use 
the preceding exercise.) 
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The following result, which was proved by Miintz in 1914, is an interesting 
generalisation of the Weierstrass Approximation Theorem. 

Let ® sequence in [l,oo) that diverges to oo. Then 

span {l,a;^Tx''^, . . .} is dense inC[0, 1] if and only if the series 
1/^n diverges to oo. 

An elementary proof of Miintz’s Theorem can be found on pages 193-198 
of [10]. Here is a very recent generalisation of Miintz’s Theorem, due to P. 
Borwein and T. Erdelyi [6]. 

Let be a sequence of distinct positive real numbers. 

Then span |l, . . .| is dense in C[0, 1] if and only if 

^ A„ / (A^ + l) diverges to oo. 

A different, more abstract, generalisation of Theorem (4.6.1) was given 
by Stone in 1937. In this generalisation we let X he a, compact metric 
space; we consider C{X) as an algebra, with the pointwise operations of 
addition, multiplication, and multiplication-by-scalars; and we are inter- 
ested in dense subalgebras of C{X). The property introduced in the next 
definition plays a key role in the proof of the Stone- Weierstrass theorem. 

We say that a set A of real-valued functions on a metric space X sepa- 
rates the points of X if for each pair x, y of distinct points of X there exists 
f & A such that f{x) ^ f{y). 

(4.6.4) The Stone— Weierstrass Theorem. Let X be a compact metric 
space, and A a subalgebra ofC{X) that contains the constant functions and 
separates the points of X. Then A is dense in the Banach space C{X). 

The next two lemmas lead us to the proof of this theorem. 

(4.6.5) Lemma. Under the hypotheses o/ Theorem (4.6.4), if ip, if €. A, 
then ip Alp and ipV ip belong to the closure of A in C{X). 

Proof. Given f G A and £ > 0, first apply the Weierstrass Approximation 
Theorem (4.6.1) to construct a polynomial function p such that 

\\t\-p{t)\<e (0<t< ll/ll). 

Then 

I |/(x)| -po/(x)| < £ (x€X). 

Since s is arbitrary, we see that |/| G A. The desired conclusion now follows 
by taking / = \ip — ip\ and noting the identities 

ipAip= \{ip + ip- \if-ip\), 
ip \/ ip = \ {ip ip \if — ip\) . □ 
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(4.6.6) Lemma. Under the hypotheses of Theorem (4.6.4), for each pair 
X, y of distinct points of X and each pair a, b of real numbers, there exists 
g € A such that g{x) = a and g{y) = b. 

Proof. Since A separates the points of X , there exists h G A such that 
h{x) ^ h{y). Define 



g{t) 



a+ {b 



h{t) — h{x) 
h{y)-h{x)- 



Since A contains the constant functions and is an algebra, g G A . Clearly, 
g{x) = a and g{y) = b. □ 



Proof of the Stone— Weierstrass Theorem. Given / G C(X) and 
£T > 0, we need only show that there exists h G A such that ||/ — h\\ < e. 
To this end, for each ^ in .4 define 

U{g) = {xG X : g{x) < f{x) + s} , 

L{g) = {xG X : g{x) > f{x) - e} , 

and note that, as g is continuous, U{g) and L{g) are open sets. It follows 
from Lemma (4.6.6) that for each t G X the sets U{g), with g G A and 
g(t) = f(t), form an open cover of X. Since X is compact, we can extract 
a finite subcover {U{gi ), . . . , [/((/„)} of X. Define 

ht = gi U g 2 /\ ■ ■ ■ /\ g-n- 

Then ht G A, by Lemma (4.6.5); ht(x) < f{x) + s for each x G X; and 
ht{t) = f{t), so t G L{ht). Thus {L{ht))f^x i® open cover of X, from 
which we can extract a finite subcover, say {L(htf), ■ ■ ■ , L(/it^)} . Then the 
function 

h = ht^y ht^y ■ ■ - y ht,^ 
belongs to A, by Lemma (4.6.5); also, 

f{x) - £ < h{x) < f{x) + e 
for each x G X, so ||/ — h\\ < s. □ 



It is simple to verify that the Weierstrass Approximation Theorem is 
the special case of the Stone- Weierstrass Theorem in which the algebra A 
consists of all polynomial functions on the compact interval I. Since the 
polynomial functions on / with rational coefficients form a countable dense 
set in this algebra A, we see that C{I) is a separable metric space; this is a 
special case of the following more general corollary of the Stone- Weierstrass 
Theorem. 
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(4.6.7) Corollary. If X is a compact metric space, then the Banach 
space C{X) is separable. 

Proof. Let (x„) be a dense sequence in X, and for each positive integer 
k write 

fn,k{t) = X\B{xn, k~^)). 

The set S of all functions of the form 

/ Cti £0L2 

ni,kiJ ri2,k2 Jui^ki’’ 

with each au a nonnegative integer, is countable. Hence the subspace A of 
C{X) generated by S is separable (see the paragraph immediately preceding 
Proposition (4.3.8)). So to complete the proof we need only show that A is 
dense in C{X). Since .4 is a subalgebra of C{X), if S separates the points 
of X we can invoke the Stone- Weierstrass Theorem. But for each pair x, y 
of distinct points of X we can choose n,k such that x £ B(xn,k~^) and 
y £ X\B(xn, k~^). We then have fn,k{x) yf 0 (as X\B{xn,k~^) is closed) 
and fn,k{y) = 0- ° 

(4.6.8) Exercises 

.1 Let / be a strictly increasing continuous function on / = [0, 1]. Prove 
that the subalgebra of C{I) generated by {1, /} is dense in C{I). 

.2 Let A" be a compact metric space containing at least two points, and 
let A be the subalgebra of C{X) generated by the family 

(fh^p(t,a;))^g^. 

Prove that A is dense in C{X). 

.3 Define a sequence (un) of polynomial functions on R inductively, as 
follows. 

ui{t) = 0, 

^n+l(t) — ^n{t] T 2 (f ’^ni.t') ) . 

Prove that Un maps [0, 1] into [0, 1], and that the sequence {un{t))l^^i 
converges uniformly to '/i on [0,1]. Hence prove that if .4 is a 
subalgebra of C[0, 1] and f £ A, then |/| G A. 

This proof can be used to eliminate the reference to the Weierstrass 
Approximation Theorem from the proof of the Stone-Weierstrass 
Theorem, thereby making the former a genuine corollary of the latter. 

.4 Let / be a compact interval in R, and / a continuous mapping of 
the rectangle I x I into R. Prove that for each £ > 0 there exists a 
polynomial 

n 

P{x,y) = ^ aj^kX^y^ 

j,k=0 
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such that 

sup \f{x,y) -p{x,y)\ < e. 

x,yGl 

.5 Prove the Complex Stone- Weierstrass Theorem: let X be a compact 
metric space, and A a subalgebra of C{X,C) that contains the con- 
stant functions, separates the points of X, and is closed under com- 
plex conjugation (so that f* G A whenever f G A, where f*{x) = 
f{x)*); then A is dense in C{X, C). 

Can we remove the hypothesis that A is closed under complex 
conjugation? 

.6 Use the Stone-Weierstrass Theorem to prove that each 27r-periodic 
continuous function / : R — >■ C is a uniform limit of a sequence of 
trigonometric polynomials of the form 

N 

t i-G {on sin nt + bn COS nt) , 

n=-N 

where the coefficients a„,6„ belong to C (cf. Exercise (4.6.3: 9). Let 
S be the set of 27r-periodic elements of C°°(R, C). First note that 

E(c‘) = m 

defines an isometric isomorphism of S with C(T, C), where 

T = {zGC:\z\ = 1} 

is the unit circle in the complex plane.) 

.7 Let / be a compact interval, and p > 1. Prove that the Banach space 
Lp{I) is separable. Prove also that Lp(R) is separable. (First use 
Exercise (2.3.10) to prove that C{I) is dense in Lp{I).) 

4.7 Fixed Points and Differential Eqnations 

In this final section of the chapter we show how various ideas that have 
appeared in the earlier sections are used to establish the existence of a so- 
lution ip of the first-order ordinary differential equation p'{x) = f{x, p(x)) 
on a compact interval. In order to do this, we first introduce a fundamental 
fixed-point theorem. 

Let X and Y be metric spaces, and / a mapping of X into Y. We say 
that / satisfies a Lipschitz condition, or is a Lipschitz mapping, if there 
exists a constant c > 0 such that p{f{x),f{y)) < cp{x,y) for all x,y in X; 
c is then called a Lipschitz constant for /, and / is said to be Lipschitz 
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of order c. In the special case where 0 < c < 1, / is called a contraction 
mapping of X into Y. 

A Lipschitz map is uniformly continuous (Exercise (3.2.11: 6)). 

A mapping of a metric space X into itself is called a self-map. By a fixed 
point of a self-map f : X ^ X we mean a point f G X such that f{f) = f. 

(4.7.1) Exercises 

.1 Let 

n 

p{x,y) = X! 

j,k=0 

be a polynomial function of two variables x,y. Prove that p satisfies 
a Lipschitz condition on any bounded subset of . 

.2 Let / be a mapping of a metric space X into itself, and define the 
iterates of / inductively: for each x G X, 

f X if n = 0 

r(^) = { 

[ ifnGN+. 

Prove that if, for some positive integer N, f^ has a unique fixed point 
then ^ is a fixed point of /, and / has no other fixed point. 



Fixed points play an important role in many applications of mathemat- 
ics, including the solution of differential equations and the existence of 
economic equilibria [51]. Many of these applications depend on our next 
result, Banach’s Contraction Mapping Theorem. 

(4.7.2) Theorem. A contraction mapping of a nonempty complete 
metric space into itself has a unique fixed point. 

Proof. Let X he a, nonempty complete metric space, / a contraction 
mapping of X into itself, and c G (0, 1) a Lipschitz constant for /. Choose Xq 
in X, and define a sequence {xn)'ifiLi inductively by setting x„ = /(x„_i). 
For each fc > 1 we have 

p{xk,Xk+l) = p{f{xk-i),f{xk)) 

< cp(xk-i,Xk) 

< ■■■ 

< c'"/9(xo,Xi). 
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So if m > n > 1, then 

m— 1 

p{Xn,Xra) < ^ p{xk,Xk+l) 

k—n 
m— 1 

k—n 

oo 

< p{xo,xi) ^ o'" 

k—n 

= p(xo,xi)- ^ 0 as n — >■ oo. 

1 — c 

Hence (a;„) is a Cauchy sequence in the complete space X. Let ^ be its 
limit in X; then 

^ = lim Xn = lim /(x„_i) = /(^). 

n—^oo n—^oo 

Thus ^ is a fixed point of /. 

Now suppose that r; is a fixed point of / distinct from Then 

P{^,V) = PifiOJiv)) < cp{^,v) < p{^,p), 
which is absurd. Hence ^ is the unique fixed point of f in X. □ 

Recall that a self-map / of a metric space X is said to be contractive 
if p{f{x),f{y)) < p{x,y) for all distinct x,y in X; and that, according to 
Edelstein’s Theorem (Exercise (3.3.7: 4)), a contractive self-map of a com- 
pact metric space has a unique fixed point. The next exercise shows that we 
can neither remove the compactness hypothesis from Edelstein’s Theorem 
nor replace the word “contraction” by “contractive” in the hypotheses of 
Banach’s Contraction Mapping Theorem. 

(4.7.3) Exercises 

.1 Let B be the unit ball in the Banach space cq, and for each positive 
integer n let e„ be the element of cq whose nth term is 1 and all of 
whose other terms are 0. Show that there is a unique linear mapping 
n : Co — >■ Co such that 

n(e„) = e„+i 

for each n. Then show that 

v{x) = i (1 -h ||x||) Cl -h u{x) 



defines a contractive map of B into itself such that v{x) yf x for each 
X G B. (For the last part note that rifc=i (l ~ 2“^) > 1 — 2“^.) 




222 



4. Analysis in Normed Linear Spaces 



.2 Let X, Y be Banach spaces over F, U the open ball in X with centre 
0 and radius a, and V the open ball in V with centre 0 and radius b. 
Let 0 < c < 1, and let x F — >• F be a continuous mapping such 
that for all x G U, 

(i) \W{x,yi) - ip{x,y 2 )\\ < c\\yi - 7 / 2 II for all G V, and 

(ii) ||</j(a;,0)|| < 6(1 - c) . 

Show that there exists a unique mapping f : U ^ V such that 
f{x) = (p{x, f{x)) for all X G U, and that / is continuous on U. (For 
each X G U define 

fo(x) = 0, 

fn+l{x) = (fi{x,fn{x)). 

Show that fn is a continuous mapping of U into V, that the series 
(/" ~ fn-i) converges absolutely in the Banach space B{U, F), 
and that its sum is the required function /.) 

.3 Let F be a Banach space, yo G Y, V = B{yQ, b) C F, and 0 < c < 1. 
Let u be a mapping of V into F such that 

(i) lk(l/i) -x{y 2 )\\ < c\\yi -1/2II for all yi,y 2 G V, and 

(ii) l|r’(j/o) - 2/0II < ^(1 - c). 

Prove that v has a unique fixed point in V. 

.4 Let X be a metric space such that each continuous self-map of a 
closed subset of S has a fixed point. Prove that X is complete. (Sup- 
pose the contrary, and choose a Cauchy sequence (a;„) in X that does 
not converge in X. Assuming, without loss of generality, that Xi yf xj 
whenever i j, for each x G X let 

= inf {p{x, Xn) ■■ x^ x„} ■ 

Show that > 0. Next, let 0 < r < 1, set cr(0) = 0, and define a{n) 
inductively such that <j{n) > a{n — 1) and 

p{xi,Xj) {i,j>a{n)). 

Let S = {xa(n) : n > l} and f{xa{n)) = Xa{n+i)-) 

.5 Let a, b be real numbers with 0 < 6 < 1, and let X be the set of all 
continuous mappings / : [0, 6] — >• R such that /(O) = a (so, according 
to Exercise (4.5.6: 6), AT is a Banach space relative to the sup norm). 
Define a mapping T on A1 by 

(Tf){t) = a-\- f \f{x)\dx {0<t<b). 

Jo 
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Prove that T is a contraction mapping of X into itself, and hence 
that there exists a unique f G X that is differentiable and satisfies 
/' = I /I on the interval (0, 6). 

A function ip is said to be continuously differentiable on an interval I of 
R if (f' exists and is continuous on I. 

We now use Theorem (4.7.2) to prove the first of two theorems about the 
existence of solutions of ordinary differential equations, thereby generalising 
the work of Exercise (4.7.3: 5). 

( 4 . 7 . 4 ) Picard’s Theorem. Let K be the rectangle 

{{x,y) G : |a; - xq\ < a, \y - yo\ < b} 

where a, 6 > 0. Let f : K ^ H be a continuous mapping such that there 
exists c > 0 with 

\f{x,yi) - f{x,y2)\ <c\yi- 2/2I 

for all applicable x,yi,y2 {in other words, f satisfies a Lipschitz condition 
in its second variable). Let 



M= sup \f{x,y)\ 

(x,y)eK 

and 

( min {a, if M > 0 

h=l 

[a ifM = 0. 

Then there exists a unique continuously differentiable mapping ip on the 
interval L = [xq — h, xq + h], such that 

t{xo) = yo 



and 



ff{x) = f{x, p{x)) for all x G I. 



Proof. In view of the version of the Fundamental Theorem of Calculus 
in Exercise (1.5.14: 1), it suffices to find a continuous mapping : 7 — ^ R 
satisfying 

T{x)=yo+[ f{t,T{t))dt ( 1 ) 

for all x G I. Let V denote the closed ball with centre y >-G yo and radius b 
in the Banach space (C{L), H-H) , where H-H denotes the sup norm, li y gV, 
then for alH G 7 we have \y{t) — yo\ < b and therefore (t,y{t)) G 77; so 



f{t,y{t))dt 



Fy{x) =yo + 




224 



4. Analysis in Normed Linear Spaces 



defines a mapping Fy : I ^ TL. We see from Exercise (1.5.12:4) that Fy 
satisfies the Lipschitz condition 

I I 



\Fy{x) - Fy{x') \ = 






< Ml 



\Jx' I 

and is therefore uniformly continuous on /. Moreover, 



\Fy{x) — yo\ < M \x — xo\ < Mh < b 



for all X € I, so y 1-^ Fy maps V into V. 

We now endow C{I) not with its usual norm, but with the norm defined 

by 

||/ir=sup{e-2=l"-"°l|/(a:)|:a:G/}. 

Recall from Exercise (4.5.6: 7) that C{I), and hence V, is complete with 
respect to the metric p' associated with this norm. We prove that y ^ Fy 
is a contraction mapping on {y,p') . To this end, consider yi,y 2 G V and 
X G F Taking, for example, the case where x > xq, we have 



since 



\Fy,{x) - Fy^{x)\ < f \f{t,yi{t)) - f{t,y2{t))\ dt 
Jxq 

<cj \yi{t) - y2{t)\ dt 

Xq 

J Xq 

< le2=k-ol||yi-y2ir, 

J g 2 c(t-xo) dt= ^ ^ 



It follows that 



\Fyi -Pv2\\' < 7;\\yi-y2\\' {yi,y2 gC(/)). 



Applying Banach’s Contraction Mapping Theorem (4.7.2), we now obtain 
a unique element Lp oiV satisfying equation ( 1 ) . □ 



A restricted version of Picard’s Theorem can be proved by applying the 
Contraction Mapping Theorem to a certain complete subset of C(/), taken 
with the usual sup norm; this produces a positive number S, which may be 
smaller than h, and a solution of the differential equation on the interval 
[xo — 5, Xq + <5]. (See Chapter X of [13].) With a bit more work, it can 
then be shown that that solution extends to I (Exercise (4.7.5: 4)). The 
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introduction of the norm ||-|f — a device due to Bielicki [4] — both simplifies 
the proof and provides, at a stroke, the solution over the whole interval 
/. (Note that when / is only known to be defined on Ff, I is the largest 
interval on which it makes sense to talk about a solution of the differential 
equation y' = f{x,y).) 

By examining closely the proofs of Theorems (4.7.2) and (4.7.4), we 
obtain the following iteration scheme for a sequence (?/„) of functions 
converging to a solution of the differential equation in the preceding 
theorem. 



yo{x) = yo, 

yn(x) = yo+ f{t,yn-i{t))dt (n > 1). 

J Xq 

This scheme can be used in practice, although there are better methods of 
finding solutions of first-order differential equations of special types. 



(4.7.5) Exercises 



.1 Apply the foregoing iteration scheme to solve the differential equation 
y' = j/ on R with initial condition y (0) = 3. 



.2 Let 

K = {{x,y) G : |a;| < a, \y\ < 6} , 

where a, b are positive constants. Let / be a continuous mapping of 
K into R such that f{x, y) < 0 if xy > 0, and f{x, y) > 0 if xy < 0. 
Prove that x i— 1- 0 is the unique solution of the differential equation 
y' = f{x, y) defined in a neighbourhood of 0 and such that y(0) = 0. 
(Assume the contrary, and consider, in a compact interval containing 
0, the points where a solution attains its maximum or minimum.) 

.3 Define / : R^ — >■ R by 



f{x,y) = { 



Define a sequence of functions 



— 2x 


if y > x^ 


_2]/ 

X 


if |y| < x^ 


2x 


if y < — x^ 



by setting yo(x) = and 



y„+i(x) = [ f{t,y„{t))dt. 
Jo 



Show that for each x yf 0 the sequence (yn(a^))^o convergent. 

Comment on this, in the light of Exercise (4.7.5: 2) and the paragraph 
immediately preceding this set of exercises. 
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.4 Let xo,yo, I, and K be as in Theorem (4.7.4), and let / : AT — ^ R be a 
continuous function with the following property: for each (^, rj) G K° 
there exist (5 > 0 and a unique continuously differentiable function 
y ~ R- such that y{^) = y and y'{x) = f{x,y{x)) 

whenever |a; — ^| < 6. Show that there exists a unique continuously 
differentiable function : / — >■ R such that (f{xo) = yo and (p'{x) = 
f{x,(p{x)) for all X € I. (Let S be the set of all positive numbers 
S < h with the property that there exists a continuously differentiable 
function y : [xq — <5, a;o + 5] — >■ R such that y{xo) = yo and y'{x) = 
f{x,y{x)) whenever |a; — a;o| <5. Let cr = sups', suppose that a < h, 
and derive a contradiction.) 

.5 Let I be the closed interval [a, b] in R, and 

A= {(x,y) € R^ ■ a ^ X < y < bj . 

Let the function fc : / x J — ^ R be continuous on A and vanish 
everywhere on (I x I)\A, and for each f G C{I) define T f : I ^ TL 
by 

Tf{t)= [ k{s,t)f{s)ds (tGl). 

J a 

Show that for all sufficiently large n, T" is a contraction mapping of 
C{I) into itself, and hence that the integral equation 

f{t)=g{t)+[ k{s,t)f{s)ds, 

J a 

has a unique solution / in C{I) for each given g G C{I). (For the 
contraction mapping part, show that 

\T-f{x)-T-g{x)\<^{x-ar\\f-g\\ 

n\ 

for all x G / and f,g G C{I).) 

.6 Taking / = [0, 1], use the preceding exercise to find the solution of 
the integral equation 



f{t)=g{t) + c [ {t-sff{s)ds (tGl), 

Jo 

where c is a positive constant and g G C{I). 

.7 Let c > 0, let / be a continuous real-valued mapping that satisfies 
the condition 



\f{x,yi) - f{x,y 2 )\ <c\yi- 2 / 2 I 
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on the strip [a,b] x R in R^, and let (xo,yo) be any point of that 
strip. Prove that the differential equation y' = f{x,y) has a unique 
solution y : [a, &] — >■ R such that y{xo) = yo- (For each x £ [a,b] 
define yo{x) = yo and 



yn+i{x) = yo+ f{t,yn{t))dt. 

J Xq 



Let 



-^ = I 2 / 0 I + niax{|j/i(a;)| : a < x < b} . 



Show that the series 



00 



2/0 + X! “ yn-l{x)) 

n—1 



converges uniformly on [a, 6] to a sum y(x), by comparison with the 
series 



m + £ 



„_i ,^(b — a) 



n—1 



(n — 1)! 

Then show that y is the desired unique solution.) 
Let K be the compact set 



{{x,y) € R X R" : |x - xo| < a, \\y - yo|| < b} , 

where a,b > 0. Let / = (/i, . . . , /„) be a continuous mapping of K 
into R” such that there exists c > 0 with 



lf(x,yi) - /(a;, 2 / 2)1 < c\\yi - 7 / 2 II 



for all applicable x,yi,y 2 - Let 

M= sup \\f{x,y)\\ 

(x,y)eK 

and 

( min {a, if M > 0 

h=l 

[a ifM = 0. 

Prove that there exists a unique mapping tp = (tpi, . . . , ipn) of the 
interval I = [xq — h,xo + h] into R", such that 

(i) (f{xo) = yo, and 

(ii) for each k the component mapping pk is continuously differen- 
tiable and satisfies p'j.{x) = fk{x,p{x)) on I. 
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.9 Let p, q, and r be continuous real-valued functions on the interval 
[a, b] , let xo G [a, b] , and let yo, j/g be real numbers. Use the preceding 
exercise to prove that there exists a unique function y : [a, &] — R 
satisfying the differential equation 

y" +p{x)y' + q{x) = r{x) 

on [a, b], with initial conditions y{xo) = yo and y'{xo) = y^. 

Although Picard’s Theorem enables us to solve, both in principle and in 
practice, a large class of differential equations, there are simple examples of 
differential equations to which it does not apply and yet for which solutions 
can easily be found. One such example is the equation y' = y^^^ with initial 
condition y(0) = 0: this equation has two solutions — namely, y = 0 and 
y = (2cc/3)^^^ — but the function (x,y) i-G y^^^ does not satisfy a Lipschitz 
condition at (0,0). The final theorem of this chapter covers cases such as 
this, and provides us with a good application of Ascoli’s Theorem and the 
Stone- Weierstrass Theorem. 

(4.7.6) Peano’s Theorem. Let K be the rectangle 

{{x,y) G : I® - xo\ < a, \y - yo\ < 6} , 
where a, 5 > 0. Let f : K ^ R be a continuous mapping, 

M= sup \f(x,y)\ 

{x,y)^K 

and 

( min{a, if M > 0 

h=l 

[a ifM = 0. 

Then there exists a continuously differentiable mapping p on the interval 
L = [xq — h, xg + /i] , such that 

= Vo 



and 



(p'{x) = f{x, p{x)) for all x € I. 



Proof. Using Exercise (4.6.8: 4), construct a sequence (p„) of polynomial 
functions of two variables such that ||/ — Pn\\ < for each n, where H-H 
denotes the sup norm on C{K). We may assume that |p„| < 2M for each n. 
By Exercise (4.7.1: 1), Picard’s Theorem, and the Fundamental Theorem 
of Calculus, the integral equation 

y{x) = yo+ / Pn{t,y{t))dt 

J Xq 
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has a unique solution (/?„ on the interval I. Exercise (1.5.12:4) shows that 
for all xi,X 2 G I, 



\‘Pn(x2) - (Pn{xi)\ < 



Pn(t,(p„{t))dt 



< 2M \x2 — a:i| 



It follows that {<Pn) is an equicontinuous sequence in C{I). Also, for each 
X G I, 



\‘Pn{x)\ < |?/o| + 



Pn{t,ipn{t))dt 



< |yo|+ 2 M|/| ; 



so {(pn) is a bounded sequence in C{I). Applying Ascoli’s Theorem (4.5.8), 
and, if necessary, passing to a subsequence of {pn), we may now assume 
that {pn) converges uniformly on I to an element p of C{I). Since / is 
uniformly continuous on K, for each e > 0 there exists t > 0 such that if 
(xi,yi) G K and 

max{|a:i - X 2 I , \yi - 2 / 2 I} < t, 



then 

\f{xi,yi) - f{x 2 ,y 2 )\ < £■ 



Choose N such that for all n > N, 



\\p - pn\\ < min{t,£} 

and 2“" < e. Consider any x G I and any n> N. Note that for each t G I, 
{t, p{t)) belongs to the closed set K and 

l/(C7>W) - < e. 



We now have 



! Xq 

and therefore 



f{t,p{t))dt- [ f{t,pn{t))dt 
J Xn 



< e\x — xo\ < |/| e 



p{x)-yo- f fit, pit)) dt 

J Xr\ 



< \pix) - Pnix)\ 

+ Pnix)-yo- Pnit,Pnit))dt 

Xq 
>'X 

iPnit, pnit)) - fit, Pnit))) dt 
Co 

''X 

if it, Pnit)) - fit, pit))) dt 

Co 

< £ + 0+ |/| 11/ — Pn\\ + |/|£ 

= (1 + 2|J|)£. 
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Since e > 0 is arbitrary, we conclude that 

ip{x)=yo+f f{t,ip{t))dt (x€l). 

J Xq 

A final application of the Fundamental Theorem of Calculus (see Exer- 
cise (1.5.14: 1)) shows that Lp is continuously differentiable and satisfies the 
desired conditions. □ 

There are two fundamental differences between Picard’s Theorem and 
Peano’s: 

• in the former the solution is unique, whereas in the latter it need not 
be; 

• the proof of Picard’s Theorem embodies an algorithm for com- 
puting the solution, but Peano’s Theorem uses the highly noncon- 
structive property of sequential compactness and is an intrinsically 
nonalgorithmic theorem. 

By an e -approximate solution to the differential equation 

y' = f{x,y), y{xo) = yo (2) 

in an interval J containing xg we mean a mapping y : J — >■ R with the 
following properties. 

— There exists a partition (xi,X 2 , . . . ,x„) of J such that y is continu- 
ously differentiable on each of the intervals [xi,Xi+i ] ; 

— \y'{x) - f{x,y{x))\ < e for all x G U”=i^ {xi,Xi+i) ; 

— y{xo) = yo- 



(4.7.7) Exercises 

.1 Under the hypotheses of Theorem (4.7.6), but without invoking that 
theorem, show that for each e > 0 there exists an e-approximate 
solution of (2). (Choose i5 > 0 such that |/(a;i,yi) — f{x 2 ,y 2 )\ < £ 
whenever (xi,yi) G K and ||(a:i,yi) — (x 2 , 2/2)11 < S. Take points xg < 
xi < ■ ■ ■ < Xn = xg + h such that Xi+i — Xi < min {(5, S/M} , and 
construct an £-approximate solution of (2) on [a;o,xo -I- h] that is 
linear on each of the intervals [xi,Xi+i ] ; then deal with the interval 
[xo — h^xg]. This technique is known as the Cauchy-Euler method.) 

.2 Under the hypotheses of Theorem (4.7.6), let (£„) be a sequence of 
positive numbers converging to 0, and for each n let ipn be an e„- 
approximate solution of the differential equation (2) on I. Suppose 
that (v?n) converges uniformly to a continuous function on /. Prove 
that 
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(i) {t, G K for each t £ I ] 

(“) Ixo dt f{t, ip{t)) dt uniformly on / as n oo ; 

(iii) (/? is a solution of the differential equation (2) on I. 

3 Use the preceding two exercises to give an alternative proof of Peano’s 
Theorem. 

4 Let I, K, f, M, and c be as in the hypotheses of Picard’s Theorem. Let 
£i, £2 > 0, and let ipi be an £j-approximate solution to the differential 
equation on I. Show that 



pCp-Xol _ 1 

Wl{x) - ^ 2 {x)\ < Wlixo) - + (£i +£ 2 ) 

c 

for each x £ I. (Use Exercise (2.3.3: 14).) Hence find an alternative 
proof of Picard’s Theorem. 




5 

Hilbert Spaces 



When shall we three meet again...? 

Macbeth, Act 1, Scene 1 

This chapter explores the elementary theory of Hilbert spaces. In Section 1 we 
introduce the notion of an inner product, with its associated norm, on a linear 
space, and prove some fundamental inequalities. The next section deals with 
orthogonality, projections, and orthonormal bases in a Hilbert space, and with 
their use in approximation theory. In Section 3 we derive Riesz’s characterisation 
of the bounded linear functionals on a Hilbert space, and show how this can 
be applied both in the theory of operators and to prove the existence of weak 
solutions of the Dirichlet Problem. 



5.1 Inner Prodncts 

So far we have shown how to abstract the notions of distance and length 
from Euclidean space to the abstract contexts of a metric space and a 
normed space, respectively. In this chapter we show how to abstract the no- 
tion of the inner product in R" to the context of a linear space. The result- 
ing combination of distance, length, and inner product provides the space 
with an extremely rich structure that turns out to have many significant 
applications in pure and applied mathematics. In particular — although we 
are not able to explore that subject in this book — certain linear self-maps 
of such a space are the mathematical analogues of quantum-mechanical 
operations. 
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By an inner product on a linear space X over F we mean a mapping 
{x, y) !->■ {x, y) of X X X into F such that the following hold for all x, y, z 
in X and all A, y in F. 

IPl (x,y) = {y,x)*. 

IP2 {Xx + yy,z) = \{x,z) + y{y,z). 

IPS {x, x) > 0, and {x, x) = 0 if and only if x = 0. 

The element (x, y) of F is then called the inner product of the vectors x 
and y. Note that by IP2, the inner product is linear in the first variable; 
and that by IP2 and IPl, it is conjugate linear in the second — that is, 

(x, Xy + yz) = X*{x, y) + y*{x, z). 

We define an inner product space, or a prehilbert space, to be a pair 
{X, (•, •)) consisting of a linear space X over F and an inner product (•, •) 
on X. When there is no confusion over the inner product, we refer to X 
itself as an inner product space. By a subspace of an inner product space 
X we mean a linear subset S of X , taken with the inner product induced 
on S by that on X ; thus the inner product on S is the restriction to S' x S' 
of the inner product on X. 

The simplest example of an inner product space is the Euclidean space 
F", with the inner product of vectors x = (xi, . . . , x„) and y = (j/i, . . . , y„) 
defined by 

n 

(x,y) = ^xkyl- 

k=l 

For another example consider the linear space ? 2 (C) of square-summable 
sequences in C, introduced in Exercise (4.4.4: 3), where the inner product 
of two elements x = (xfe) and y = (yk) is defined as 

OO 

(x,y) = ^xkyl- 

k=l 

This can be regarded as a generalisation of the first example, since the one- 
one mapping (xi, . . . , x„) i— >■ (xi, . . . , x„, 0, 0, . . .) of C" into ? 2 (C) preserves 
the value of the inner product. 

Before discussing a third example, in Exercise (5.1.1: 2), let us agree to 
call a complex-valued function / on a subset X of R integrable if its real 
and imaginary parts are integrable over X, in which case we define 

f f= f Re(/) + i / Im(/). 

Jx Jx Jx 

The complex integration spaces Lp(X, C) are then defined in the obvious 
way, and we use Lp{X,F) to denote either Lp{I) or Lp{X, C), depending 
on whether F = R or F = C. 
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(5.1.1) Exercises 

.1 Prove that the equation {x,y) = does define an inner 

product on hiC). (You must first prove that the series ^nVn is 

convergent when (cc„) and (y„) are elements of hiC).) 

.2 By a weight function on a compact interval / = [a, b] we mean a 
nonnegative continuous function w on I such that if / € C{I) and 
fj w{t)f{t) dt = 0, then / = 0. Prove that 

{f,g)= [ 

J a 

defines an inner product on L 2 {I,F) (where, as always, we iden- 
tify two elements of L 2 (I, F) that are equal almost everywhere). We 
denote the corresponding inner product space by L 2 ,«i(d,F). 

(5.1.2) Proposition. Let X be an inner product space. Then 

Ikll = 

defines a norm on X. Moreover, the inner product and this norm satisfy 
the Cauchy-Schwarz inequality 

l(^,y)l<INI IMI 

and Minkowski’s inequality 

{x + y,x + y)^^^ < {x, xf^"^ + {y, y^^^ . 

Proof. We first prove the two inequalities. For any x,y G X and any 
A G F we have, by IPl through IPS, 

0 < {x + Xy, X + Xy) = {x, x) + {x, Xy) + (Ay, x) + (Ay, Ay) , 

so 

||xf + A*(x,y) + A(x,y)* + AA*||yf >0, (1) 

with equality if and only if a; -I- Ay = 0. If ||y|| yf 0, the Cauchy-Schwarz 
inequality is obtained by taking X = — (x,y) / |ly||^; if |lx|| yf 0, the equality 
is obtained by taking X = — (x,y) / ||x||^ ; if ||a:|| = ||y|| = 0, then IPS shows 
that X = 0 = y and hence that (x,y) = 0, so the Cauchy-Schwarz inequality 
holds trivially. 

Taking A = 1 in (1) and using the Cauchy-Schwarz inequality, we obtain 

{x + y, x + y) = {x, a;) -k 2 Re {x, y) + (y, y) 

< (x,a;)-k2|(x,y)|-k(y,y) 

< (x,a;)-k2||a;|| ||y|| -k(y,y) 

= ((a;,x)^/V (y,y)^^^) , 
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which immediately yields Minkowski’s inequality. It is now a simple ex- 
ercise, involving this inequality and the defining properties of an inner 
product, to show that x (x,x)^^^ is a norm on X. □ 

When we refer to the norm or the metric structure on an inner product 
space X, we always have in mind the norm, and the corresponding metric 
structure, associated with the inner product as in Proposition (5.1.2). 

(5.1.3) Exercises 

.1 Complete the details of the proof that if (•, •) is an inner product on 
a linear space X, then ||a:|| = {x,x)^^‘^ defines a norm on X. 

.2 Prove that an inner product on a linear space X is continuous, and 
that it is uniformly continuous on bounded sets, with respect to the 
corresponding product norm on X x X. 

.3 Prove the parallelogram law for vectors x, y in an inner product space: 

lk + 2/f + ||x-yf = 2||xf + 2||yf . 

Interpreting a norm as a length, we see that this law generalises the 
plane geometry theorem that the sum of the squares of the diagonals 
of a parallelogram equals the sum of the squares of its sides. 

.4 Use the parallelogram law to show that a Hilbert space is uniformly 
convex (see Exercise (4.2.2: 15)). 

.5 Let X be a normed space whose norm satisfies the parallelogram law 
(see the exercise before last) . Show that if F = R, then 

{x,y) = 3 {\\x + yf - Ik-j/f) 

1 /*2 

defines an inner product on X such that ||a;|| = (x,x) ' for each 
X G X. Then show that if F = C, there is a unique inner product on 
X related to the norm in this way. 

.6 Prove that there is no inner product on C[0, 1] such that (/, = 

ll/ll (the supremum norm). (Show that the supremum norm does not 
obey the parallelogram law.) 

.7 Prove that the inner product space L 2 ,j«(d, F), introduced in Exer- 
cise (5.1.1: 2), is complete. Prove also that C(/,F) is not a complete 
subspace of this inner product space. 
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An inner product space that is complete with respect to its norm is 
called a Hilbert space. For example, the Euclidean space F” is a Hilbert 
space, as is hiC). On the other hand, if tu is a nonnegative weight function 
on a compact interval I, then Exercise (5.1.3: 7) shows that L 2 ,j«(d, F) is 
complete, but C(/, F) is not complete, with respect to the inner product 

{f,g) = dt. 

(5.1.4) Exercises 

.1 Let X be an inner product space, Xq a closed linear subspace of X, 
and If the canonical mapping of X onto the quotient space X/Xq. 
Prove that 

{(p{x),ip{y)) = (x,y) 

unambiguously defines an inner product on A/Aq, and that the 
corresponding norm is the quotient norm on X/Xq. 

.2 Show that each inner product space A can be embedded as a dense 
subset of a Hilbert space H. (Extend the inner product by continuity 
to the completion of A, as defined on page 179.) H is then known as 
the {Hilbert space) completion of A. 

.3 Prove that any two completions H and H' of an inner product space 
A are isomorphic, in the sense that there exists a one-one linear 
mapping u of H onto H' such that (u{x),u{y)) = (x,y) for all x,y € 
H. 

.4 Let / be a compact interval. Show that L 2 {I, F) is the completion of 
the Hilbert space C(/, F) with respect to the inner product (f,g) = 
fi db 

5.2 Orthogonality and Projections 

Two elements x, y of an inner product space A are said to be orthogonal if 
(x, y) = 0, in which case we write x _L t/. In view of IPl, the relation _L is 
symmetric: x _L y if and only if y _L x. A vector x is said to be orthogonal 
to the subset S' of A if x _L s for each s G S; we then write x _L S. The 
set of all vectors orthogonal to S is called the orthogonal complement of S, 
and is written (pronounced “S perp” ) . It follows from IP2 that S'*- is a 

(linear) subspace of A; and from IPS that S fl S'*" is nonempty if and only 
if 0 G S, in which case S fl S'*- = {0}. Moreover, S'*- is orthogonal to S, 
in the sense that every element of S'*- is orthogonal to S: for, by Exercise 
(5.1.3: 2), if (s„) is a sequence of elements of S converging to s G S, then 
for each x G S"*-, 

(x, s) = lim (x, s„) = 0. 

n—^oo 
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For each x G X , {x}'*" is the kernel of the continuous linear functional 
z !->■ (z,x) on X, and so, by Proposition (4.2.3), is a closed subspace of X. 
Hence for each subset S of X, 



ses 



is closed in X. 

If X and y are orthogonal vectors, then, expanding {x + y, x + y), we 
obtain Pythagoras’s Theorem: 

\\x + yf = \\xf + \\yf. 

(5.2.1) Proposition. Let S he a nonempty complete convex subset of an 
inner product space X, and let a G X . Then there exists a unique vector s 
in S such that ||a — s|| = p{a, S). 

Proof. Let d = p{a,S), and choose a sequence (s„) in S such that 
p{a,Sn) — >■ d. Using the parallelogram law (Exercise (5.1.3: 3)) and the 
convexity of S, for all m and n we compute 

||Sm - Snll^ = IlSm - - (s„ - a)|l^ 

= 2 ||sm - a\\^ + 2 Ils„ - a||^ - ||sm - a + (s„ - a)|l^ 

2 2 1 1 1 1 1 2 
= 2 (z|| 2 Ct\\ 4 II 2 “1“ '^n) ^\\ 

< 2 ||sm - a\f + 2 ||5n - a||^ - 4(f 

= 2 - a\f - + 2 (^||s„ - a\f - d^^ 

— 0 as TO, n — >■ oo. 

Hence (s„) is a Cauchy sequence. Since S is complete, (s„) converges to a 
limit s in S'; then 



||a-s||= lim ||a - s„|| = p(a, S). 

n—^oo 

On the other hand, if s' G S and ||a — s'|| = p(a, S), then a computation 
similar to the one used at the start of the proof shows that 

||s — s'll^ = 2 ||s — a||^ + 2 ||s' — a||^ — 4 || j(s + s') — a||^ 

= 4 - II i(s + s') - a||^^ 

< 0 , 



so that s = s'. □ 
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It is worth digressing here to prove a converse of the foregoing result. 

(5.2.2) Proposition. Let S he a nonempty closed subset of the Euclidean 
space such that each point o/R^ has a unique closest point in S. Then 
S is convex. 

Proof. Supposing that S is not convex, we can find a,b £ S and A € (0, 1) 
such that 

z = Xa + {1- X)bi S. 

Since AT\S' is open, there exists r > 0 such that B{z,r) fl S' = 0. Let E 
be the set of all closed balls B such that B{z,r) C B and S Ci B° = 0; 
then B{z, r) £ T . The radii of the balls belonging to T are bounded above, 
since any ball containing B and having sufficiently large radius will meet 
S. Let Too be the supremum of the radii of the members of E , and let 
{B{xn,rn))'l^=i be a sequence of elements of E such that r„ — >• Too- Then 
Xn G B{z, Too) for each n. Since B{z, Too) is compact (Theorem (4.3.6)) and 
therefore sequentially compact (Theorem (3.3.9)), we may assume without 
loss of generality that (x„) converges to a limit Xoo- Let K = B(xoo,roo)', 
we prove that K £ E. 

First we consider any x £ B(z,r) and any e > 0. Choosing m such that 
\\xm — a^ooll < £) and noting that B{z,r) C B{xm,rm), we have 

\\x 3^oo|| ^ 11^ ^m|| ll^m ^oo|| 

<rjn+£ 

< Too + e. 

Since e is arbitrary, we conclude that ||a; — Xoo || < roo', whence B{z,r) C K. 
On the other hand, supposing that there exists s G S' 0 B{xaoEao), choose 
(5 > 0 such that ||s — cCoo|| < Voo—d, and then n such that 0 < roo — Vn < 5/2 
and \\xn — a:oo|| < 5/2. We have 

||s-a;„|| < ||s 

^ oo 
^ oo 

< Tn, 

SO s £ Sr\B{xn, Vn). This is absurd, as B{xn, r„) G E; hence Sr\B{xoo,roo) 
is empty, and therefore K £ E. 

Now, the centre x^o of K has a unique closest point p in S. This point 
cannot belong to K° , as K £ E; nor can it lie outside K, as Too is the supre- 
mum of the radii of the balls in E . Therefore p must lie on the boundary 
of K. The unique closest point property of S ensures that the boundary of 
K intersects S in the single point p. It now follows from Exercise (4.3.7: 7) 
that there exists a ball K' that is concentric with K, has radius greater 



^C3o|| ||^C30 ^n\\ 




S 

2 
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than Too, and is disjoint from S. This ball must contain B{z,r) and so be- 
longs to T. Since this contradicts our choice of Too, we conclude that S is, 
in fact, convex. □ 

(5.2.3) Exercises 

.1 Give an example of a norm ||-|f on R^, a closed convex subset S of 
, and a point x G R^ such that x has infinitely many closest points 
in S relative to ||-|f . 

.2 Let S be the subset of cq consisting of all elements (xn) such that 
= 0. Show that S' is a closed subspace of cq and that 
no point of cq\S has a closest point in S. (Given a = (a„) € cg\S, 
set a = 2“"a„ and show that p(a, S) < |a| . Let x = (x„) G S, 

suppose that ||a — x\\ < |o;| , and obtain a contradiction.) 

.3 Let S be a closed convex set in a uniformly convex Banach space 
X. (See Exercise (4.2.2: 15).) Show that to each point a of X there 
corresponds a unique closest point in S. (To prove existence, reduce 
to the case where a = 0 and p(0, S) = 1. Then choose a sequence (s„) 
in S such that ||s„|| — >■ 1, and show that is a Gauchy 

sequence in X.) 

.4 Give two proofs that cq is not uniformly convex. 

.5 Let S' be a bounded closed subset of R^ with the property that to 
each X G R^ there corresponds a unique farthest point of S — that is, 
a point So of S such that 

||x - Soil = suplllx - s|| : s G S} . 

Show that S consists of a single point. (First show that S is bounded. 
Then choose r > 0 such that S C B{0,r/2), and consider the family 
T of all closed balls B such that S C B C B{0,r). Show that T 
contains a ball with minimum radius, and then show that that radius 
is 0.) 

Proposition (5.2.2) was first proved by Motzkin in 1935, and the result 
in Exercise (5.2.3: 5) by Motzkin, Straus, and Valentine in 1953. 

We now turn from our digression to the subject of projections. In the 
special case of Proposition (5.2.1) where S' is a complete subspace of X, the 
unique point of S closest to a given vector a; G V is called the projection 
of the vector x on S, and the mapping that carries each vector in X to its 
projection on S is called the projection of X on S. For example, 

• the projection of X on V is the identity operator I : X ^ X defined 
by Ix = x; 
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• projections on finite-dimensional subspaces of X are always defined, 
since finite-dimensional normed spaces are complete, by Proposition 

(4.3.3); 

• the projection of a Hilbert space on any closed subspace is defined, 
in view of Proposition (3.2.9). 

The next result enables us to show that projections are bounded linear 
mappings. 

(5.2.4) Proposition. Let S be a complete subspace of an inner product 
space X, and P the projection of X on S. Then for each x € X, Px is the 
unique vector s € S such that x — s is orthogonal to S. 

Proof. Given x in X , let d = p{x, S). For all y in S and A in F, we have 
Px — Xy £ S, so that 

{x — Px + Xy, X — Px + Xy) > d'^ = {x — Px, x — Px ) , 
and therefore 

lAl^llyf + 2Re(A* {x-Px, j/)) > 0. 

Suppose that Re {x — Px, y) yf 0; then by the Cauchy-Schwarz inequality, 
2 / yf 0. Taking 

{x- Px, y) 

/\ • 

\\y\? 

we obtain the contradiction 

|A|"|| 2 /f + 2Re(A* {x-Px, j/)) < 0. 

Thus Re {x — Px, y) = 0. Likewise, Im {x — Px, y) = 0, so {x — Px, y) = 0. 

If, conversely, s is any vector in S such that a; — s is orthogonal to S , 
then s — Px is in S, and so 

(s — Px, s — Px) = {x — Px, s — Px) — (x — s, s — Px) = 0; 
whence s = Px, by IP3. □ 

(5.2.5) Exercises 

.1 Prove that if S' is a complete subspace of an inner product space X , 
then (S-L)-L = S. 

.2 Let P be the projection of a Hilbert space PI onto a complete subspace 
S. Use Proposition (5.2.4) to show that P is a linear mapping, and 
that 

{Px,Py) = {Px,y) = (x,Py) 

for all x,y £ H. Show also that ||Pa;|| < ||a;|| for all x £ H, and that 
if S y^ {0} , then ||P|| = 1. 
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.3 In the notation of the preceding exercise prove that each vector x € H 
has a unique representation in the form x = y + z with y G S and 
z S, and that I — P is the projection of i? on S'-*-. 

.4 To each vector a in an inner product space X there corresponds a 
linear functional Ua defined on X by 

Ua (x) = {x, a) . 

Prove that Ua is bounded and has norm ||a|l; and that if a yf 0, then 
z = ||a|| ^ a is in ker (ua) , Ua {z) = 1, and a = ||z|| ^ 2 :. 

.5 Let / be a nonzero linear functional on the Euclidean space R”. 
Prove that there exists a nonzero vector p orthogonal to the hyper- 
plane ker(/), such that f{x) = (x,p) for each x G R”. (Choose 
a G R"\ker(/) such that f(a) = 1. Let b be the foot of the per- 
pendicular from a to ker(/), and let p = A(a — b) for an appropriate 
value of A. Note that each x G R” can be written uniquely in the 
form X = f{x)a + y with y € ker(/).) 



A family of elements of an inner product space X is said to 

be orthogonal if {ei,ej) = 0 whenever i,j are distinct indices in I. If, in 
addition, ||ei|| = 1 for each i, then (e^) is called an orthonormal family, in 
that case we call {x, ef) the corresponding tth coordinate of the element x 
of A. 

For example, in the space L 2 ([— 7r,7r], C) , taken with the inner product 
the functions e„ (n = 0, ±1, ±2, . . .) form an orthonormal sequence, where 



e„(t) = ei"‘. 



The corresponding nth coordinate of / G L 2 ([— tt, tt], C) is 



1 

27T 



r7T 



/(t)e-^"‘dt. 



which is better known as the nth Fourier coefficient of /. 
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(5.2.6) Exercise 

Verify the mathematical claims made in the last paragraph. 

If (ei)i^j is an orthonormal family in an inner product space X , then for 
any finite index set J G I the vectors ej {j G J) are linearly independent: 
for if = 0, where each Xj G F, then for each f G J, 

0 = / ^ ^ ) g \ ^ ^ Xj {cj, Cj) = Xi- 

\jeJ / jeJ 

Thus the vectors Cj (j G J) form a basis for a finite-dimensional subspace 
of V. 

(5.2.7) Lemma. Let (e„)^^j^ be a finite orthonormal family in an inner 
product space X. Then for each x G X, 

|2 



N 



n—1 
N 



N 



n—1 



n—1 

N 

= < ikf , 



n—1 



and X — (^> 6") e„ is orthogonal to each e^. 

Proof. For each n write A„ = {x, e„) . Then 



N 



X! (2^-en) ' 



n—1 



N 



N 



— ( ^ ^ ^ ^ ^n^n / 

\m=l n—1 / 

N N 

— ^ ^ ^n) — ^ ^ \^r, 



So 



0 < 



N 



X ^ ^ 
n—1 

2 / ^ 

1 (^1 ^ ^ ^ ^ ^ ^n^ni ■ 



N 



Y^Xn 



N 



N 



N 



= a: 






n—1 



n—1 



n—1 
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N N 

n—l n—1 

N 

= ikf -Ei^"i'- 

n—l 

The first two of the desired conclusions now follow. On the other hand, 

/ AT \ N 

lx ^ ^ y — (^, t-fc) ^ ^ {^m — (^j ^k') ^k — 0- ^ 

\ n=l / n=l 



(5.2.8) Proposition. If (ej)jg/ is an orthonormal family in an inner 
product space X, then for each x G X, 

Ix — \i ^ I '• {xj ef) ^ 0 } 

is either empty or countable. 

Proof. Lemma (5.2.7) shows that for each finite subset J of / we have 

^\{x,ei)\^ < ||a;f . 

i€J 



Hence the set 

Ix.n = |z G / : |(a;,ei)|^ > n“^ ||a;||^| 

has at most n—l elements. Since Ix = U^i ^x,m we conclude that if Ix is 
nonempty, then it is countable. □ 

When (ci)ig/ is an orthonormal family in an inner product space X, 
Proposition (5.2.8) enables us to make sense of certain summations, such 
as X)ie/ > over possibly uncountable index sets. If Ix is empty, we 

define ~ nonempty, it is either finite or countably 

infinite; taking, for example, the latter case (the former is even easier to 
handle), we define 



^\{x,Ci)\^ = ^\{x,fn)\^ , (1) 

ie/ n=l 

where /i, / 2 , ... is a one-one enumeration of Ix- Note that the series on the 
right-hand side converges, since its terms are nonnegative and (by Lemma 
(5.2.7)) its partial sums are bounded by ||a;|| ; it follows from Exercise 
(1.2.17: 1) that the value of the expression on the left-hand side of (1) 
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is independent of our choice of the one-one enumeration fi, f 2, ■■ ■ of Ix- 
Moreover, we have Bessel’s inequality 

^\{x,ei)\^ < ||a:f . 
i^I 

In turn, when X is a Hilbert space, we can give meaning to another 
important type of series. Writing 

k 

^ ^ (3^? /n) fn 
n —1 



and using Lemma (5.2.7), we see that if A; > j, then 






{xjn)fn 

n=j+l 



= LI 

n=j + l 



Since l(^> /n)l^ converges, (s„) is a Cauchy sequence in X; so, as X is 

complete, (^> fn) fn converges to a sum s € X. Likewise, if f[, / 2 , ■ • ■ 

is another one-one enumeration of Ix, then fn) fn converges to a 

sum s' G X. We show that s = s'. Given e > 0, we choose X such that if 
k > N, then 



where 



|s-Sfc||<£, ||s' - 411 < and L l(a;, /n)l^ < 

n—k-\-l 



4 = L fn) fn- 



Taking 

m = max {/c : 4 = /„ for some n < N} . 
we see that m> N and 



OO 

llSm-SArf< L l4./n)l^<4. 



Hcncc 

||, _ /|| < ||, _ ,^|| + ||,^ _ + 114 _ /|| < 3s. 

Since e is arbitrary, it follows that s = s'. Hence the value of 

OO 

L (x, Ci) Ci = L fn) fn 
iel n=l 

is independent of the choice of the one-one enumeration fi, f 2, .. . of lx- 
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(5.2.9) Exercise 

Let {ei)i^i be an orthonormal family in a Hilbert space iL, and x^y 
elements of H such that Ix is countably infinite. Show that the value 
of the expression 

OO 

^ (x, Ci) (Ci, y) = Y^ {x, /„) (/„, y) 

iel n=l 

is independent of the one-one enumeration fi, f 2, ■■ ■ oi Ix- Show also 
that if /{, /2, . . . is a (possibly finite) one-one enumeration of ly, then 

00 

^ (x, a) (Ci, y) = ^ (x, /;) (/;, y) . 

iei n=l 

(5.2.10) Proposition. Let {ei)i^i he an orthonormal family in a Hilbert 
space H, let S be the closure in H of the subspace of H generated by (e^), 
and let P he the projection of H on S. Then for all x,y in H, 

Px = ^ (x,Ci) e„ 
i^I 

\\x-Pxf = ||xf -^|(x,ei)|^ 
iGl 

(Px, Py) = ^ (x, €i) {e„ y) . 
iei 

Proof. Consider, for example, the case where Ix is countably infinite. Let 
/i) /2) ... be a one-one enumeration of Ix- Lemma ( 5 . 2 . 7 ) shows that 

N N 

X-'^{x,fn)fn = ||xf - ^ |(X,/„)|^ 

n—1 71—1 

N N 

'^{xjn)fn =^|(X,/„)|% 

ri—1 n—1 

and X - Y,n=i /") /" i® orthogonal to /i, . . . , /at. Letting iV -)> 00, we 
see that 

2 

x-^(x,ei)ci = ||xf - ^ |(x,ei)|^ , 

iGl 

2 

^(x,Ci)ei =^|(x,ei)|^, 
iel i&I 
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and z = X — orthogonal to each /„. For each i G I either 

Cj = fn for some n, and therefore 2 _L e^, or else i ^ Ix', the latter case, 
using the continuity of the inner product, we have 



{Z, 6i) = {X, Ci) - ( ^ {X, fn) fn, 

\n=l / 

00 

= 0 - X! 

n—1 

00 

= - (fn,ei) 

n—1 

= 0, 



as 6i is orthogonal to each fn- It now follows that 2 is orthogonal to each 
vector in S', and hence, by Proposition (5.2.4), that Px = 

Using Exercise (5.2.5: 2), the continuity of the inner product, and 
Exercise (5.2.9), we now obtain 

{Px, Py) = {Px, y) 

00 

X! fn,y 

n=l 
o 

n—1 

= ^{x,&i) {ei,y) ■ □ 

iei 




By an orthonormal basis of a Hilbert space PI we mean an orthonormal 
family that generates a dense linear subspace of H . The following is a more 
or less immediate consequence of Proposition (5.2.10). 



(5.2.11) Proposition. The following are equivalent conditions on an 
orthonormal family (ei)ig/ in a Hilbert space H. 

(i) (ci) is an orthonormal basis of H. 

(ii) X = Xie/ each x G H. 

(iii) Xie/ = ||a;||^ for each xG H. 

(iv) {x, y) = Xig/ {x, Ci) {a, y) for all x,y G H. 
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The identity in condition (iv) of this proposition is known as Parseval’s 
identity. 

(5.2.12) Exercises 

.1 Prove Proposition (5.2.11). 

.2 Use Zorn’s Lemma (Appendix B) to prove that every nonzero Hilbert 
space has an orthonormal basis. 

.3 Let (ei)ig/ be an orthonormal basis in a separable Hilbert space H. 
By considering the balls B{ei, l/-\/2), or otherwise, show that / is a 
countable set. 

.4 Let H be an infinite-dimensional inner product space, and (e„)^j^ an 
infinite orthonormal sequence of vectors in H. By considering (e„), 
and without invoking either Theorem (4.3.6) or Exercise (4.3.7: 4), 
prove that the unit ball of H is not sequentially compact. 

.5 Let be an orthonormal basis of a separable Hilbert space H, 

and (a„)^i an element of hiC). Show that there exists a unique 
element a of H such that (a, e„) = a„ for each n. (Show that the 
partial sums of the series ^nOn form a Cauchy sequence.) 

.6 Prove that the functions 

t !->■ e„(t) = (n G Z) 

V 27T 

form an orthonormal basis of L 2 ([— tJ", tt], C) . (Noting Exercise 
(5.2.6), show that the linear space S generated by {e„ : n G Z} is 
dense in L2 ([— tt, tt], C) . To do this, first consider / G C([— tt, tt], C). 
Construct a continuous function g on R, with period 27 t, such 
that II/— ( 7 II 2 is arbitrarily small. Then use Exercise (4.6.8: 6) to 
approximate g, and therefore /, by an element of S.) 

It follows from this exercise and Proposition (5.2.11) that for each 
/ G L 2 ([— 7>') ’’■]) C) the corresponding Fourier expansion 

00 

/(n)e'"^ 

n— — oo 

converges to / in the L 2 norm, where 




In this case Parseval’s identity takes the form 

.. 00 

\f{x)\^dx=— Y |/W 
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.7 Take /(x) = x in the preceding exercise and apply Parseval’s identity, 
to show that 

.8 Show that = tt^/ 90. (Consider /(x) = | (x^ — tt^) .) 



Although Zorn’s Lemma guarantees the existence of an orthonormal ba- 
sis in any Hilbert space (Exercise (5.2.12:2)), it does not enable us to 
construct orthonormal bases. The following Gram-Schmidt orthonormali- 
sation process enables us to construct orthonormal bases when the Hilbert 
space H is separable. 

Using Proposition (4.3.8), first construct a (possibly finite) total sequence 
(oi, tt 2 , . . .) of linearly independent vectors in H. For each n let be the 
n-dimensional subspace of H spanned by {oi, . . . , a„}; since this subspace 
is complete (by Proposition (4.3.3)), the projection of H onto it is 
defined. Suppose we have found orthogonal vectors 6 i,..., 6 „ generating 
Hn- li H = Hn, stop the construction. Otherwise, a„+i ^ ; so by 

Proposition (5.2.4), 

hn+l — On-t-1 Pn^n+l 
is orthogonal to iL„, and therefore 

{b„+i,bk) = 0 (l<k<n). 

Elementary linear algebra shows that { 6 i, . . . , 6 „+i} is a basis of Hn+i- 
This completes the inductive construction of a (possibly finite) orthogonal 
sequence (5i, 62 , . . .) in iJ such that for each n, {bi, . . . ,&„} is a basis of 
Hn- Setting e„ = || 6 n||~^ bn and noting that Un dense in H, we see 
that (e„) is an orthonormal basis of H . 

The Gram-Schmidt orthonormalisation process has a very important 
application in approximation theory, which we now describe. 

Let w be a nonnegative continuous weight function on a compact interval 
I = [a, b] . Define the inner product 



{f^g)w= [ 

J a 

on L 2 {I), and the corresponding weighted least squares norm by 



l2,u) 




1/2 



{t)f{tfdt 



Given an element / of L 2 {I) and a natural number N, we have the 
approximation problem: 
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Find the polynomial function p of degree at most N that 
minimises the value of 

\\f-p\\2,w= [ 

J a 

This polynomial is called the least squares approximation to f of degree at 
most N. 

Now, the set Vn of polynomials of degree < is a finite-dimensional 
subspace of C(/); so the projection P/v of C{I) on Pn exists, and the unique 
least squares approximation to / of degree at most N is given hy pn = PNf- 
To compute the coefficients of pjq, we can use elementary multivariate 
calculus to calculate the values of Aq, . . . , Aat that minimise 




see [29]. However, this procedure is computationally inefficient if we are 
looking for least squares approximations to several functions in C (/) . In that 
case a better procedure is to apply the Gram-Schmidt process to the total 
sequence consisting of the monomials to compute orthonormal 

polynomials qg^qi, . . . , where qn{t) has degree n and {go, • • ■ , q-n} is a basis 
for Pn, then 

N 

PNf = {f,qn)n, q-a, 

n— 0 

by Proposition (5.2.10). One advantage of this method is that, having found 
the least squares approximation to / of degree at most n, in order to 
find the least squares approximation of degree at most n + 1 we simply add 
to Pn the single term (/, g„+i)^ g„+i. 

(5.2.13) Exercises 

.1 In the notation of the preceding paragraphs, take / = [—1,1] and 
w{t) = 1, and compute qo,qi, and q 2 - Hence find the quadratic least 
squares approximation to e^ in C[— 1, 1]. 

.2 Let w be a nonnegative continuous weight function on / = [a, &], let 
/ G C{I), and for each n let denote the least squares approximation 
to / of degree at most n. Prove that 

lim ||/-P„|| 2 ,^ = 0- 

n— >-oo ’ 

.3 In the notation of the last exercise, let (g„) be a sequence of polyno- 
mial functions that is orthogonal relative to (•,•)„, such that g„ has 
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degree n. Prove that each polynomial p of degree n can be written 
uniquely as a linear combination of qo, . . . , qn, and find the coefficient 
of qk in this linear combination. 

.4 Continuing Exercise (5.2.13:3), prove that qn{t) has n distinct real 
zeroes, and that those zeroes lie in the open interval (a, b). (Let 

p{t) = tm), 



where ti, . . . ,tm are the zeroes of (?„(t) in (a, b) at which g„(t) changes 
sign. Assume that m < n, show that w{t)p{t)qn{t) dt yf 0, and 
deduce a contradiction.) 



.5 Continuing Exercise (5.2.13:4), write 



Qn{t) — Ant" + Bnt" ^ + 

An+1 



A. 



fdn — Cty] 



n+1 

n+1 



B, 

A. 



and, for n > 1, 



^n+1 ^n—1 {QnjQn) 

w 

Al ((7„_i,(7„_i)^' 

Prove the triple recursion formula: 

qn+i{t) = {a„t + Pn)qn{t) -7„g„_i(t). 

(Consider p{t) = g„+i(t) - antqn{t)-) 

.6 Let I = [a, b], let w € C{I), and let p be a polynomial function. Prove 
the equivalence of the following conditions. 

(i) w(t)p{t)q{t) dt = 0 for all polynomial functions q of degree at 
most n. 

(ii) There exists an (n+ l)-times differentiable function u on I such 
that 

w{x)p{x) = {x G I) 

and 

u^^^(a^) = m^^^( 6“) = 0 (fc = 0, 1, . . . , n). 
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.7 Let I = [—1, 1], let a,P G (— l,oo) , and let 

w{x) = (1 — x)^ (1 + x)^ (x G I). 

For each n € N define the Jacobi polynomial of degree n by 
Rodrigues’s formula: 

= (1 - (1 + ((1 - (1 + 

(where, of course, d”/da:” denotes the nth derivative). Use the pre- 
ceding exercise to prove that {4>n)^=Q is an orthogonal sequence in 
F2.»(/,C). 

.8 In the special case a = /? = 0 of the last exercise, the Jacobi poly- 
nomial 4>n is known as a Legendre polynomial and is usually denoted 
by Pn- Prove that the Legendre polynomials satisfy the recurrence 
relation 

Pn+i{x) = {4:n + 2)xPn{x) - 4n^P„_i(x) 

on [—1, 1]. Use this and Exercise (5.2.13: 1) to find Psix) and P^x). 
(To establish the recurrence relation, write each term in the form 

in— 1 

where q{x) is a quadratic polynomial.) 



5.3 The Dual of a Hilbert Space 

We saw in Exercise (5.2.5: 4) that for each vector a in an inner product 
space X the mapping a; i— (x, a) is a bounded linear functional on X. We 
now show that the dual of a Hilbert space consists precisely of bounded 
linear functionals of this form (cf. Exercise (5.2.5: 5)). 

(5.3.1) The Riesz Representation Theorem. If u is a bounded linear 
functional on a Hilbert space H , then there exists a unique vector a G H 
such that u{x) = (x,a) for each x G H. In that case ||m|| = ||a|| . 

Proof. We first dispose of the uniqueness^ of a: indeed, if 
(x, a) = u{x) = (x, a') (x G H) , 



^This uniqueness argument applies to a linear functional of the form x (x, a) 
on an inner product space. 
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then, taking x = a — a' , we obtain 

||a — a'll^ = {a — a', a — a') = 0, 



so a = o'. 

To establish the existence of o, we may assume that tt yf 0. As t6 is linear 
and continuous, ker (u) is a closed subspace of H (Proposition (4.2.3)). Let 
P be the projection of H on ker (u), and choose y £ H such that u{y) yf 0. 
Setting 

2 = u{y)-^ {y- Py ) , 

we see that z G ker (u)"'", by Proposition (5.2.4), and that 
u{z) = u{y)~^ {u{y) - u{Py)) = 1. 

So for each x in iJ we have 



X — u{x)z € ker(u) 



and therefore 



0 = (x — u(x)z, z) = (x, z) — u{x) {z, z) = (x, z) — u(x) ||z|| 



Thus u(x) = (x, o) , where a = ||z|| z. The Cauchy-Schwarz inequality 
shows that |m(x)| < ||o|| ||x|| . Since also u ^||o||~^ o^ = ||o|| , we see that 



u = o 



□ 



(5.3.2) Exercises 

.1 Find an alternative proof of the existence part of the Riesz Represen- 
tation Theorem (5.3.1) for a separable Hilbert space H. (Let 
be an orthonormal basis of H, and u a bounded linear functional 
on H. Show that w(e„)*e„ converges to the desired element 

a G H.) 

.2 Use the Riesz Representation Theorem to give another solution to 
Exercise (5.2.12:5). (In the notation of that exercise, show that 
u{x) = 6n) defines a bounded linear functional on H.) 

.3 By the second dual of a normed space X we mean the dual space 
X** = (A*)* of X*. We say that X is reflexive if for each u G X** 
there exists x„ G A such that u{f) = f{xu) for each / G A*. Prove 
that any Hilbert space is reflexive. 



By an operator on a normed space A we mean a bounded linear mapping 
from A into itself; the set of operators on A is written L{X). (Strictly 
speaking, we have here defined a bounded operator; since we do not consider 
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unbounded operators in this book, it is convenient for us to use the term 
“operator” to mean “bounded operator”.) 

It is common practice to denote the composition of operators by juxta- 
position; thus if S, T are operators on X, then To S' is usually written TS; 
moreover, we write T^ for TT, T^ for T(TT), and so on. 

For a first application of the Riesz Representation Theorem, let T be 
an operator on a Hilbert space H , and for each a G X consider the linear 
functional a; i— >■ {Tx, a) on H . The inequality 

|(Tx,a)|<||rcr||||a||<||T|| \\a\\\\x\\ 

shows that this functional is bounded and has norm at most ||T|| ||a||. By 
the Riesz Representation Theorem, there exists a unique vector T*a such 
that 

{Tx, a) = {x, T*a) {x G H ) ; 

moreover, 

r*a||<||T||||a||. (1) 

The mapping T* : H ^ H so defined is called the adjoint of T, and is an 
operator on H . To justify this last claim, consider a,b in H and A, /i in F. 
Since 



{Tx, Aa + fib) = A* {Tx, a) + fi* {Tx, b) 

= A* {x,T*a)+fT {x,T*b) 

= {x, XT* a + fiT*b) 

for all X G H, we see that 

T* (Aa -b fib) = XT* a + fiT*b. 

So T* is linear. Inequality (1) shows that T* is bounded and has norm at 
most ||T||. Since 

{T*x,y) = {y,T*x)* = {Ty,x)* = {x,Ty) , 

the uniqueness of the adjoint of T* shows that (T*)* = T. So ||T|| = 
||(T*)*|| < ||T*|| and therefore ||T*|| = ||T|| . 

An operator T on H is said to be 

• selfadjoint, or Hermitian, if T* = T; 

• normal if T*T = TT* . 

Selfadjoint and normal operators have particularly amenable properties and 
are among the most important objects in Hilbert space theory. (See [24], 
[44], and other books that deal with such topics as spectral theory.) 

(5.3.3) Exercises 

In all the exercises of this set except the first, H is a complex Hilbert space, 
S and T are operators on H, and ran(T) denotes the range of T. 
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1 Let (ei, 62 , . . . , e„) be an orthonormal basis of the Euclidean Hilbert 
space F”, and T an operator on F". Show that 

n 

Tx = ^ {x,6j) {Tej,ek) ek, 

j,k=l 

and hence that T can be associated with the n-by-n matrix whose 
(j, /c)th entry is {Tej, Ck) ■ With what matrix is T* associated in this 
way? 

2 By a hounded conjugate-bilinear functional on H we mean a mapping 
u : H X H ^ C that is linear in the first variable, conjugate linear 
in the second, and bounded, in the sense that there exists c > 0 such 
that \u{x,y)\ < c||x|l ||?/|| for all x,y G H. The least such c is the 
number written 

||m|| = sup{|-«(x,y)| : x,y G H, ||x|| < 1, ||?/|| < 1} . 

Show that the mapping u : H x H ^ C defined by 

u{x,y) = (Tx,y) (2) 

is a bounded conjugate-linear functional on H such that ||m|| = ||T|| . 
Show also that each bounded conjugate-linear functional u on H is 
related to a unique corresponding operator T as in equation (2). (For 
the second part, show that for each x G H the mapping y i— >■ u{x, y)* 
is a bounded linear functional on H.) 

3 Verify the polarisation identity. 

4:{Tx,y) = (T{x + y), x + y) - {T{x-y), x-y) 

+i {T{x + iy), x + iy) -i {T{x - iy), x-iy) . 

Show that if (Sx,x) = (Tx,x) for all x £ H, then S = T. 

Give an example of a nonzero operator T on the real Hilbert space 
such that (Tx,x) = 0 for all x € R^. 

4 Let A, fj, be complex numbers. Show that (AS' + yT)* = X*S* + y*T* 
and (ST)* = T*S*. 

5 Prove that T*T and TT* are selfadjoint. 

6 Prove each of the following statements. 

(i) ker(T*) = ran(T)-*- 

(ii) ran(T*) = ker(T)-*-. 

(hi) ker(T) = ker(T*T). 
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(iv) ran(TT*) is dense in ran(T). 

.7 Show that 

(i) T is selfadjoint if and only if {Tx, x) G R for all x G H. 

(ii) T is normal if and only if ||Tx|| = ||T'*x|| for each x G H. 

(For part (i), consider {Tx,x) — {T*x,x) , and note Exercise (5.3.3: 3).) 

.8 Prove that T is a projection if and only if T*T = T, in which case 
T is idempotent — that is, = T. (For “if”, show first that T is 
selfadjoint, and then that {x — Tx) _L Ty for all x,y G H.) 



We close this chapter by sketching how the techniques of Hilbert space 
theory can be applied to prove the existence of a type of solution for one 
of the fundamental problems of potential theory. (For more information on 
this topic, see, for example, pages 117-122 of [23].) 

For the rest of this chapter only, we follow the usual notational conven- 
tions of applied mathematicians. Thus we denote three-dimensional vectors 
by boldface letters, the element of volume in by dV, the element of sur- 
face area by dS', the unit outward normal to a surface by n, and the inner 
product of two vectors u, v in R^ by u • v. We assume familiarity with 
calculus in R^, including the elementary vector analysis of the gradient 
operator V and the divergence operator div, defined, respectively, by 



V/ 



(di di an 

\ i9x ’ dy’ dz ) 



for a real-valued function /, and 



div u = 



dux duy duz \ 
(9x ’ dy ’ dz ) 



for a vector u = (ux,Uy,Uz)- We also assume the fundamentals of the 
theory of L 2 {^) when is a Lebesgue measurable subset of R^. 

Let be a bounded open set in R^ for which Gauss’s Divergence 
Theorem holds: 



divudP = 



u • n dS”, 



IdQ 



where dD, is the boundary surface of and u : — >■ R^ is continuously 

differentiable on fl. It follows that Green’s Theorem holds in the form 




f f dv du\ 
Jan 



"^v — vV^u) dV 



dS', 
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where u, v are twice continuously differentiable mappings of into R, d/dn 
denotes differentiation along the outward normal to d^l, and is the 
Laplacian operator, 

= 1 1 . 

dx'^ dy'^ dz"^ 

We assume the following result, embodying Poincare’s inequality. 



There exists a constant c > 0 such that if v : ^ H is 

differentiable on and vanishes on the boundary offl, then 





||Vu|r dy 



1/2 



For a proof of this inequality under reasonable conditions on we refer to 
[37], Chapter 5, Theorem 1. 

Given a bounded continuous function / : > R, we consider the 

corresponding Dirichlet Problem: 

Find a function u : — >■ R that is twice differentiable on fl, 

satisfies = / on fl, and vanishes on the boundary offl. 

Suppose we have found a solution u of this Dirichlet Problem. Let w : — >■ 

R be twice differentiable and have compact support in — that is, w = 0 

outside some compact subset of fl. Then it follows from Green’s Theorem 
that 

[ uV^udP= [ vfdV, (3) 

Jq Jq 

since both u and v vanish on dfl. 

Now, it may not be possible to solve the Dirichlet Problem on but, as 
we now show, we can find a function m on that behaves appropriately on 
did and that satisfies (3) for all u : — ?► R that are twice differentiable and 

have compact support in il. More advanced theory of partial differential 
equations then provides conditions on il under which this so-called weak 
solution u of the Dirichlet Problem can be identified with a solution of the 
standard type. 

Let CQ(n) be the space of functions u : — >■ R that have compact 

support in il and are differentiable on and let C^(n) be the space of 
functions u : — >■ R such that u is differentiable on il and Vm extends to a 

continuous function on il. Let C^(n) be the space consisting of all elements 
of C^(n) that vanish on dil, Hq{H) the completion of C^(n) with respect 
to the inner product defined by 

(u,v) = / Vu • Vu dP, 

Jn 
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and II ’ll// the corresponding norm on It is not hard to show that 

CQ(n) is dense in i?o(H) with respect to this norm, and that can be 

identified with a certain set of Lebesgue integrable real-valued functions u 
on n. ^ 

Now define a linear functional (pf on C^(f2) by 

Vfiv) = [ vfdV. 

Jn 



Applying the Cauchy-Schwarz inequality in the Hilbert space L 2 {^), we 
obtain 

l¥^/WI < ' ■ 

Hence, by Poincare’s inequality, 



\^f{v)\ < c 



||Vt;f dP 



1/2 



\H 5 



where the constant c is independent of v. Thus the linear functional /?/ is 
bounded. It therefore extends by continuity to a bounded linear functional 
Pf on see Exercise (4.2.2:10). Thus, by the Riesz Representation 

Theorem (5.3.1), there exists a unique element u of such that 

‘Pf{v) = -{v,u) (vGi?o(n)). 



For each v that has compact support in H and is twice differentiable on fl, 
we now use the elementary vector identity 

div (uVv) = Vu ■ Vv + uV'^v 



and Gauss’s Divergence Theorem to show that 



/ uW^vdV = — (v,u) + / div (mVv) dP 
Jfi Jo. 

= Pf{v)+ / uVv-ndS 

JdQ 



= / vfdV. 
JQ 



(Recall that u = 0 on the boundary of fl). This completes the proof that u 
is the weak solution that we wanted. 




6 

An Introduction to Functional 
Analysis 



...a wonderful piece of work; which not to have been 
blessed withal would have discredited your travel. 

Antony and Cleopatra, Act 1, Scene 2 

In this chapter we first discuss the Hahn-Banach Theorem, the most famous case 
of which provides conditions under which a bounded linear functional on a sub- 
space of a normed space X can be extended, with preservation of its norm, to a 
bounded linear functional on the whole of X. We then present several applica- 
tions of this theorem, some of which illustrate the interplay between a normed 
space and its dual. In Section 2 we use the Hahn-Banach Theorem to obtain re- 
sults about the separation of convex sets by hyperplanes. The last section of the 
chapter introduces the Baire Category Theorem, and includes some of its many 
applications in classical and functional analysis. 



6.1 The Hahn-Banach Theorem 

Let X be a linear space over F. If F = C, then by a complex-linear 
functional on X we mean a mapping f : X ^ C such that 

f{x + y) = f{x) + f{y) 



and 



f{Xx) = Xf{x) 
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for all x,y € X and all A € C. If / maps X into R and satisfies these 
equations for all real numbers A, then / is called a real-linear functional 
on X. 

According to our first lemma, real-linear functionals can be characterised 
as the real parts of associated complex-linear functionals. 

(6.1.1) Lemma. Let X he a complex normed linear space. If f is a 
complex-linear functional on X and u is the real part of f, then u is a 
real-linear functional on X and 

f (x) = u{x) — iu{ix) {x G X). (1) 

If u is a real-linear functional on X and f is defined by equation (1), then 
f is a complex-linear functional on X. Moreover, if f and u are related 
as in equation (1) and either f or u is bounded, then both functionals are 
hounded and ||/|| = ||w|| . 

Proof. If / is a complex-linear functional on X and tt = Re (/), then it 
is easy to show that u is real-linear; moreover, equation (1) follows from 
the fact that z = Re (z) — i Re(iz) for any complex number z. On the other 
hand, if m is a real-linear functional on X, and / is defined as in (1), then 
it is clear that f{x + y) = f{x) + f{y), and that f{\x) = Xf{x) for all real 
A. Also, 

f{ix) = u{ix) — m{i^x) 

= u{ix) — m{—x) 

= u{ix) + m{x) 

= i/(a;), 

from which it follows that / is complex-linear. 

If / is bounded, then as |M(a:)| < |/(a;)| for all x & X , u is bounded 
and 11^11 < ||/||. For each x £ X there exists A G C such that |A| = 1 and 
/(Ax) = Xf {x) = |/(x)|; then /(Ax) G R, so 

|/(x)| = /(Ax) 

= Re (/(Ax)) 

= u(Xx) 

<IH| ||Ax|| = |H| ||x||. 

Hence ||/|| < ||m||, and therefore ||/|| = ||u||. Finally, if u is bounded, then 
for all X G X with ||x|| < 1 we have 

|/(x)| < |u(x)| + |M(ix)| 

< Ikll (INI + llia^ll) 

<2 Nil, 

so / is bounded. By the foregoing, ||/|| = ||u||. □ 
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Let X be a vector space over F, and p a mapping of X into R. We say 
that p is 

• subadditive if p{x + y) < p{x) + p{y) for all x, y G X] 

• positively homogeneous if p{\x) = \p{x) for all x € X and A > 0; 

• a sublinear functional if it is subadditive and positively homogeneous; 

• a seminorm if it is nonnegative and subadditive, and if p{\x) = 
|A|p(x) for all X G Af and A G F. 

For example, if c > 0, then p{x) = c ||x|| defines a sublinear functional on 

X. 

Now let Xq be a subspace of X, /o a linear functional on Afg, and / a 
linear functional on X. We say that / extends /o to X, or that / is an 
extension of /g to X, if /(x) = /g(x) for all x G Afg. If also / is bounded 
and ll/ll = ||/g|| , we say that / is a norm-preserving extension of /g to X. 

We now prove a preliminary version of the extension theorem for linear 
functionals. 

(6.1.2) Proposition. Let X be a real normed space, Xq a subspace of 
X, xi a point of X\Xq, and X\ the subspace of X spanned by Xq U {xi}. 
Let p be a sublinear functional on X, and /g a linear functional on Xq such 
that /g(x) < p{x) for all x G Xq. Then there exists a linear functional f 
that extends /g to Xi and satisfies /(x) < p{x) for all x G Xi. 

Proof. Since xi ^ Xq, each element of Xi can be written uniquely in 
the form x + Axi with x G Xq and A G R. Let t be any real number, and 
provisionally define 

/(x + Axi) = /g(x) + At. 

It is easily shown that / is a linear extension of /g to Xi ; hence it remains 
to choose T so that 

/g(x) + At < p(x + Axi) (x G Afg, A G R\ {0}) . (2) 

To this end, replacing x by Ax, using the positive homogeneity of p, and 
then dividing both sides of (2) by |A| , we observe that (2) is equivalent to 
the two conditions 

/g(x) + T < p{x + xi) if X G Xq and A > 0, 

— /g(x) — T < p{—x — Xi) if X G Xq and A < 0. 



In turn, these two conditions can be gathered together in one: 

-p{-X - Xi) - /g(x') <T <p{x + Xi) - /g(x) (x, X G Xq). 
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But for all x, x' £ Xq we have 

fo(x) - fo(x') = fo{x- x') 

< p(x — x') 

= p{x + Xi — x' — Xi) 

< p{x + Xi) + p{ — x' — Xi) 

and therefore 



-p{-x' - xi) - fo{x') < p{x + xi) - fo{x). 

Thus in order to satisfy ( 2 ), and thereby complete the proof, we need only 
invoke Exercise ( 1 . 1 . 1 : 21 ). □ 

This brings us to the Hahn- Banach Theorem. 

( 6 . 1 . 3 ) Theorem. Let Xq be a subspace of a real normed space X , p 
a sublinear functional on X, and fo a linear functional on Xq such that 
fo{x) < p{x) for all x in Xq. Then there exists a linear functional f that 
extends fo to X and satisfies f{x) < p{x) for all x £ X. 

Proof. Let T denote the set of all linear functionals / that are defined 
on subspaces of X containing Xo and that have the following properties. 

(i) f = fo on Xo and 

(ii) f{x) < p{x) for all x in the domain of /. 

Define the binary relation ^ on . 7 ^ by inclusion: 

f 4 9 ii and only ii f C g. 

Then ^ is a partial order on T . Let C be a chain in T (that is, a nonempty 
totally ordered subset of T), and define 

G = U 3 = 2/) : 3^ G C (y = g(a;))} . 

sec 

If (x, yi) G G and (x, 92) G G, then there exist 91,92 £ C such that (x, yi) G 

91 and (x,j/2) £ 92- Since C is a chain, either gi C 92 or else, as we may 
assume, 92 C gp, then (x,t/2) G 9 i and therefore, as 91 is a function, 

92 = 9i - It follows that G is a function on X; and that if x is in the domain 
of some 9 £ C, then x is in the domain of G, G(x) = g{x), and therefore 
G(x) < p{x). It is easy to show that the domain of G contains Xo, and 
that G = fo on Xo- To complete the proof that G £ if, we must show 
that G is linear on X. To this end, given x,x' in the domain of G, choose 
g,g' G C such that (x, G(x)) G g and (x',G(x')) G g' . As C is a chain, we 
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may assume that g' C 5, so that (a;', G{x')) £ g; as g is linear, it follows 
that X + a;' is in the domain of g and therefore in the domain of G, and 
that 



G{x + x') = g{x + x') 

= 9i.x)+g{x) 

= G{x) + G{x'). 

Similarly, for each A € R, G(Ax) = AG(x). Hence G £ T. li is trivial to 
verify that G is an upper bound of C in T . 

We can now apply Zorn’s Lemma (see Appendix B) to produce a maximal 
element / of T . It only remains to show that / is defined throughout X. 
But if / is not defined at some point xq of A, then, using Proposition 
(6.1.2), we can find an element g oi T such that f ^ g and g is defined at 
xq. Since / is maximal in X, it follows that / = g, a contradiction. □ 

The name “Hahn-Banach Theorem” is often applied to the following 
corollary. 

(6.1.4) Corollary. Let Xq be a subspace of a normed space X, and /o 
a bounded linear functional on Xq. Then there exists a norm-preserving 
extension of fo to X. 

Proof. First consider the case where /o is a real-linear functional on Xq. 
Applying Theorem (6.1.3) with p(x) = ||/o|| ||x||, we obtain a real-linear 
functional / that extends /o to X and satisfies /(x) < ||/o|| ||x|| for all 
X £ X. Replacing x by — x in this last inequality, we see that 

|/(x)| = max{/(x), -/(x)} < ||/o|| |lx|| 

for all X G A; whence / is bounded, and ||/|| < ||/o|| . But / extends /o, so 
ll/ll > ll/oll and therefore ||/|| = ||/o|| . 

When / is a complex-linear functional, we apply the foregoing argument 
to construct a norm-preserving extension u of the real-linear functional 
Re(/o) to A. Lemma (6.1.1) then shows us that 

/(x) = u(x) — iu(ix) 

defines a norm-preserving extension of /o to A. □ 

(6.1.5) Exercises 

.1 Prove the complex Hahn-Banach Theorem: let Aq be a subspace of 
a complex normed space X, p a seminorm on A, and /o a linear 
functional on Aq such that |/o(a;)| < p{x) for all x G Aq; then there 
exists a linear functional / that extends /o to A and satisfies |/(x)| < 
p{x) for all X G A. (First apply the Hahn-Banach Theorem to the 
real-linear functional Re(/o).) 




264 6. An Introduction to Functional Analysis 

.2 Let AT be a separable normed space. Prove Theorem (6.1.3) without 
using Zorn’s Lemma. (Let (a;„) be a dense sequence in X, and p a 
sublinear functional on X. Starting with a given linear functional /o 
on a subspace Xq of X, extend /o inductively to the subspace Xn of 
X spanned by Xn-i U {xn} , such that the linear extension /„ to X„ 
satisfies fn{x) < p{x) for all x G AT„. Then consider / = fn-) 

The Hahn-Banach Theorem — especially in the form of Corollary (6.1.4) — 
has many interesting applications. We begin with some of the simpler 
ones. 

(6.1.6) Proposition. Let S he a closed subspace of a normed space X, 
and let xq € X\S. Then there exists a bounded linear functional f on X 
such that 

(i) /(xo) = 1 and 

(ii) f{x) = 0 for all x G S. 

Proof. Let Xq be the subspace of X spanned by S' U {xg}, and define 
/o(x + Axg) = A (x G S, A G F). 

(This is a good definition: for, as Xg ^ S, the representation of a given 
element of Xq in the form x + Axg, with x G S and A G F, is unique.) Then 
fo is a linear functional on Xq, /o(x) = 0 if x G S, and /o(xg) = 1. Now, 
as S is closed, we see from Exercise (3.1.10: 3) that p(xg, S) > 0. So for all 
X in S and all nonzero A G F, 

I|x + Axg|| = |A| ||A“^X + Xg|| > |A| p(xg, S). 

Hence 

|/(x + Axg)| = |A| < p(xg, S)“^ I|x + Axg|| , 

and therefore fo has the bound p(xg,S)“^. Applying Corollary (6.1.4) to 
fo, we obtain the desired linear functional f on X. □ 

(6.1.7) Proposition. If xq is a nonzero element of a normed space X, 
then there exists a bounded linear functional f on X such that /(xg) = ||xg|| 
and ll/ll = 1. 

Proof. Let Xq be the subspace of X generated by {xg}, define a linear 
functional /g on Xq by /o(Axg) = A||xg||, and apply Corollary (6.1.4) to 
/o- □ 



(6.1.8) Corollary. For each x in a normed space X, 
Ikll = sup{|/(x)| : f GX*, ll/ll = 1}. 
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Proof. If x = 0, the conclusion is trivial. If x ^ 0, then for all / € X* 
with ll/ll = 1 we have 



l/(^)l <11/11 INI = lkll- 

Since, by Proposition (6.1.7), there exists / G X* such that ||/|| = 1 and 
f{x) = ||x|| , the result follows. □ 

The remaining results and exercises in this section illustrate the interac- 
tion between a normed space X and its dual X * , one of the most fascinat- 
ing and beautiful aspects of modern analysis, in which the Hahn-Banach 
Theorem plays a fundamental part. 

(6.1.9) Exercises 

.1 Show that if X is a finite-dimensional Banach space, then X* is 
finite-dimensional and dim(X*) = dim(Jf). (Reduce to the case 
where X is n-dimensional Euclidean space.) 

.2 Let S' be a closed subspace of a Banach space X, and define 

S< = {/ G X* : f{x) = 0 for all x G S} . 

Prove that S< is a closed linear subspace of X* . Show that the fol- 
lowing procedure yields a well-defined mapping T of S* into X*/S<: 
given / in S*, choose a norm-preserving extension E of / to X, and 
set T f equal to the element of X*/S< that contains F. Prove that T 
is a norm-preserving linear isomorphism of S* onto X*/S<. Hence 
prove that for each F G X*, 

sup{|F(x)| : X G S, ||x|| < 1} = inf {||E- /|| : / G S<} . 

.3 Let S be a closed linear subspace of a Banach space X, let (p be 
the canonical map of X onto X/S, and for each / in (X/S)* define 
T f = fo(p. Prove that T is an isometric linear isomorphism of (X/S)* 
onto S'-*-. Hence prove that for each x G X, 

inf {||x- s|| : s G S} = sup{|/(x)| : / G S-^, |1/|| < 1} . 

.4 Let xi, . . . , x„ be elements of a real normed space X, and ci, . . . , 
real numbers. Prove the equivalence of the following conditions. 

(i) There exists / G X* with ||/|| = 1 and /(xj) = Cj for each i. 

(ii) |AiCi -l- • • • -l- A„c„| < ||AiXi -I- • • • -I- A„x„|| . 
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.5 Let AT be a normed space, and define 

m) = fix) (x € X, f € X*). 

Prove that 

(i) the mapping x i— a; is a linear isometry of X into its second dual 

(ii) X is reflexive (see Exercise (5.3.2: 3)) if and only if this mapping 
has range X**; 

(iii) if X is reflexive, then it is a Banach space. 

.6 Prove that if X is an infinite-dimensional normed space, then X* 
is infinite-dimensional, (cf. Exercise (6.1.9: 1). Suppose that X* is 
finite-dimensional, and consider the mapping x i— x defined in the 
preceding exercise.) 

.7 Let AT be a Banach space. Prove that 

(i) if X* is separable, then so is X; 

(ii) if X is separable and reflexive, then X* is separable. 

(For (i), let {/i, /2, ■ . ■} be dense in X* , and for each n choose a unit 
vector x„ such that |/n(x„)| > \ ||x„|| . Let Y be the closure of the 
subspace generated by {xi,X2,...}, suppose that F yf AT, and use 
Proposition (6.1.6) to deduce a contradiction.) 

.8 Prove that a closed subspace F of a reflexive Banach space X is 
reflexive. (For each / G X* let fy denote the restriction of / to F. 
Given u S F**, choose ^ G X such that m(/f) = /(C) for all / G X*. 
Then use Propositions (6.1.6) and (6.1.4).) 

.9 We saw in Exercise (4.4.11:5) that the space L^o, introduced in 
Exercise (4.4.11:2), can be identified with the dual space of Li = 
Li(R).In this exercise we show that Li can be identified with a 
subset of the dual of L^o but is not the whole of that dual. 

Prove that for each g G Li, 



Ugif) = J ^9 

defines an element of the dual of Loo- Let Xq be the set of all con- 
tinuous functions / : R — >■ R that vanish outside some compact set, 
and define a bounded linear functional uq on Xq by uq(/) = /(O)- 
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Using Corollary (6.1.4), construct a norm-preserving extension u of 
uo to Loo- By considering u{fn), where 

r (i-|x|)" if |x|<i 

fn{x) = < 

[ 0 if |x| > 1, 

show that there is no element g of Li such that u = Ug. 

It follows from this exercise that, in contrast to Lp for 1 < p < oo 
(see Theorem (4.4.10)), Li is not reflexive. 

The next three lemmas, together with our work on the Hahn-Banach 
Theorem, enable us to produce a substantial strengthening of the following 
consequence of Riesz’s Lemma (4.3.5): in an infinite-dimensional normed 
space, if 0 < 0 < 1, then there exists a sequence (x„) of unit vectors such 
that \\xm — a^nll > 0 whenever m ^ n. 

(6.1.10) Lemma. ///, /i, . . . , /„ are linear functionals on a linear space 
X over F such that ker(/) D nr=i O'Xe linearly 

dependent. 

Proof. We may assume that none of the functions under consideration 
is identically zero. We proceed by induction on n. In the case n = 1, choose 
a £ X such that /i(a) = 1. Then for each x £ X, 

{x- fi{x)a) £ ker(/i). 



SO 

0 = f{x- fi{x)a) = f{x) - f{a)fi{x). 

Hence / = /(a)/i, and therefore / and fi are linearly dependent. 

Now suppose that the lemma holds for n = k, and consider the case 
n = k + 1. Let g be the restriction of / to ker(/fc+i), and for i = 1, . . . , fc 
let gi be the restriction of fi to ker(/fe+i). Then ker((/) D rii=i so 

g = ^i9i for some elements Xi of F, by our induction hypothesis. 

Thus / — ^i^iXifi vanishes on ker(/fe+i). By the case n = 1 that we 
have already proved, / — ^ifi /fe+i &re linearly dependent; so 

/, /i, . . . , fk+i are linearly dependent and the induction is complete. □ 

(6.1.11) Lemma. Let X be an infinite- dimensional normed space, and 
/i, . . . , /„ elements of X*. Then ker(/i) yf {0} . 

Proof. First assume that the fi are linearly independent. By Exercise 
(6.1.9: 6), there exists an element / of X* such that /, /i, . . . , /„ are linearly 
independent. Lemma (6.1.10) now shows that Hfci kor(/i) is not contained 
in ker(/), from which the desired conclusion follows immediately. 
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Now consider the case where the fi are linearly dependent. Without 
loss of generality, we may assume that for some m < n, {/i, . . . , is 
a basis for the linear space generated by all the fi. By the first part of 
the proof, there exists a nonzero element ^ in H^i clearly, ^ G 

n”=iker(/ 0 - □ 

(6.1.12) Lemma. Let X be an infinite-dimensional normed space, and 

linearly independent elements of X*. Then there exist nonzero 
elements ^,77 of X such that ffirf) < 0 = ffifi) for each i. 

Proof. The existence of ^ follows from Lemma (6.1.11). On the other 
hand. Lemma (6.1.10) shows that for each i there exists Xi G X such that 
fi{xi) = 1 and fj{xi) = 0 when j i. Setting rj = — ^kat 

fi{r]) = —1 for each i. □ 

(6.1.13) Proposition. If X is an infinite- dimensional normed space, 
then there exists a sequence (x„) of unit vectors in X such that \\xm — Xn\\ 
> 1 whenever 



Proof. We construct the required vectors inductively as follows. Choos- 
ing a unit vector x\ G X, apply Proposition (6.1.7) to obtain fi G X* 
such that ll/ill = 1 = fi{xi). Now suppose that we have constructed unit 
vectors X\, . . . ,Xn in X, and linearly independent unit vectors /i, . ■ . , /n in 
X*, such that ffixi) = 1 = ||/i|| for each i. By Lemma (6.1.12), there exist 
nonzero elements ^,77 of X such that ffiif) < 0 = ffif) for each i. Choose 
c > 0 such that HryH < ||?7 -I- c^|| . Setting 

Xn+l = ll?? + cCir^ {ll+cfi), 



note that /i(x„+i) < 0 for 1 < z < n. Now use Proposition (6.1.7) to obtain 
an element fn+i of X* such that ||/„_|_i|| = 1 = /„+i(a;„+i). Suppose that 
/n+i for some elements \ of F. Then 



h + cCII 



fn+liv + cf) 

n 

>^ifiiv + c^) 

n 

\fn+l{v)\ 



< \\V+C^\\, 

a contradiction. Hence the linear functionals fi, ■ ■ ■ , fm fn+i are linearly 
independent. Moreover, if 1 < z < n, then 

li^n-t-l X%\ ^ \fi(^Xn-\-l ^z)| — |/*i(^n-t-l) fii.Xi)\ ^ 1 , 
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since fi{xi) = 1 and fi{xn+i) < 0. This completes our inductive 
construction. □ 

A sequence (a:„) in a Banach space X is called a Schauder basis if for each 
X € X there exists a unique sequence (An) in F such that x = ^nXn- 

In that case, X is separable, and the mapping x i->. can be used to 

identify X with a sequence space. 

The notion of a Schauder basis generalises that of a basis in a finite- 
dimensional space. In the spaces cq and Ip {1 < p < oo) let e„ be the 
vector with nth term equal to 1 and all other terms 0 ; then {ei, 62 , . . .} is 
a Schauder basis. Schauder bases for other separable Banach spaces, such 
as C[0, 1], are not so easy to construct, and Enflo [15] has shown that there 
exist separable Banach subspaces of Cq that do not have a Schauder basis. 
We can, however, prove the following theorem. 

(6.1.14) Theorem. Every infinite- dimensional Banach space contains 
an infinite-dimensional closed subspace with a Schauder basis. 

The next two lemmas make this possible. 

(6.1.15) Lemma. Let (Xn) be a total sequence of nonzero elements of a 
Banach space X, and c a positive number such that z/ Ai, . . . , A„ belong to 
F, and m < n, then 

m n 

^ ^ ^iXi 'fi: 0 ^ ] A^Xz . (3) 

i=l i=l 

Then (xi) is a Schauder basis for X. 

Proof. Consider any sequence (A„) in F such that converges 

in A. If n > k, then 

|Afc| = ||xfc||“^ IIAfcXfcll 

n 

<ciixfcir^ y^^XjXj 

i=k 

Letting n — >■ 00 , we see that 

00 

|Afc|<c||xfc||“^ y^XjXj 
i=k 

A simple induction argument now enables us to prove that if = 

0, then Ai = 0 for each i. Thus a given element of X has at most one 
representation in the form with each Xi in F. It remains to show 

that such a representation exists. 
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Let Xoo be the subspace of X generated by {xi,X 2 , • ■ , and for each n 

let Xn be the subspace generated by {x \, . . . , x„}. Define a (clearly linear) 
mapping P„ of X^ onto X^ by 

( oo \ n 

^ ^ I — ^ ( AjX^. 

i=l / 

It follows from (3) that c is a bound for P„ on X^o- But X^o is dense in 
X, so, by Exercise (4.2.2: 10), extends to a bounded linear mapping P„ 
on X with bound c. By Corollary (6.1.4), the mapping A^a^i '— >■ A„ 
extends to a bounded linear functional /„ on X such that 



where, for convenience, we set Po{x) = 0. 

We prove that x = fn{x)xn for each x G AT. To this end, let e > 0 

and, using the fact that the sequence (x„) is total, choose Ai, . . . , Aat in F 
such that 



X — 




n—1 



< S. 



For each k > N we have 



\\x-Pk{x)\\ < 



N 

X ^ ^ X^Xji 


+ 


N 

^ ^ XjiX'fi 


n—1 




n—1 






N 



Pk f ^ ^ • 



< £ + ||Cfc|| 



n—1 

N 






n—1 



< (1 + c)e. 



Hence 

k oo 

X = lim PkX = lim /„(x)x„ = /n(x)x„. □ 

k—¥oo k—^cx) ' ^ ^ 

n—1 n—1 

(6.1.16) S. Mazur’s Lemma. Let Y he a finite-dimensional subspace 
of an infinite-dimensional Banach space X. Then for each £ > 0 there 
exists a unit vector f € X such that 

||j/||<(l + ^)||2/ + AC|| (4) 



for all y GY and A G F. 
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Proof. Without loss of generality we may take £ < 1. Let {yi, 
be an e/2-approximation to the set 

^ = {y G : bll = 1} 

(which is compact, by Exercise (4.3.7: 2)). Using Proposition (6.1.7), for 
i = l,...,n construct fi £ X* with norm 1 such that fi{yi) = 1. By 
Lemma (6.1.11), there exists a unit vector ^ G nr=i Consider any 

vector y £ Y and any A G F. If y = 0, then (4) is trivial. If y yf 0, then we 
may assume that ||y|| = 1: otherwise, we just consider ||y|| y. Choosing i 
such that ||y — yi\\ < e/2, we have 

||y + A^ll > \\y^ + A^ll - ||y - y/| 

> h{yi + K) - I 
= 1 - £ 

1 



since £ < 1. Hence (4) obtains. □ 

Proof of Theorem (6.1.14). Let X be an infinite-dimensional Banach 
space, and e > 0. Choose positive numbers £„ such that 

In (1 + £„) < 2“”“^ In (1 + e) 

for each n. Then 

OO N 

n (l + £«) = lim TT (1 + £n) ^ Vl + £ < 1 + £. 

n—1 n—1 

Let Xi be a unit vector in X. By Mazur’s Lemma, there exists a unit vector 
X 2 £ X such that 

II 2 /II < (l + £i) ||y + Axall 

for all y in the subspace generated by and for all A G F. By the same 
lemma, there exists a unit vector X 3 £ X such that 



II 2 /II < (1 + £2) ||y + Axall 



for all y in the subspace generated by {a;i,X2} and for all A G F. Carrying 
on in this way, we construct an infinite sequence (x„) of unit vectors in X 
such that 

||y|| < (1 + £n) lly + Ax„+i|| 

for all y in the subspace generated by {xi, . . . ,x„} and for all A G F. It 
follows that if Ai , . . . , A„ G F and m < n, then 



m 

2=1 



< (1 + Sm) 



m 

^ ^ A 77), _(_lX 777,-1-1 

2=1 
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m+1 



2=1 



< (1 + ^m)(l ^m+l) 

< * 

< (1 + £m)(l + ^m+l) * * * (1 + £n-l) 



^ ^ H“ A77T,-|_2XyjT, + 2 

n 

^ ^ A^Xj 



<(l + e) 






Hence, by Lemma (6.1.15), {xn)'^=i is a Schauder basis of the closure of 
the subspace of X that it generates. □ 



For our last application of the Hahn-Banach Theorem in this section, 
we show that if / is a compact interval, then the dual space C(/)* can be 
isometrically embedded in the Banach space {BV{I), IMIbv) functions of 
bounded variation on / (introduced in Exercise (4.5.2: 4)). To this end, for 
convenience we say that a bounded function /:/—>■ R is representable if 
there exists an increasing sequence (/„) of elements of C{I) that converges 
simply to /. We denote by TZ{I) the subspace of B{I) consisting of all 
bounded real-valued functions on / that can be written as the difference 
of two representable functions. Note that C{I) C 

(6.1.17) Exercises 

1. Prove that if J is a compact subinterval of I, then ~xj is 
representable. 

.2 Let / G C(/), where / = [a, h], let P = {xq, ■ ■ ■ , x„) be a partition of 
/, and for each k {0 < k < n — 1) let be any point of [xk,Xk+i]. 
Define ip G B(I) by 

n— 1 

^ = E/(^'=)(>^[a, ,,,])• 

k=0 

Show that 11/ — f/ll — >■ 0 as the mesh of P tends to 0. 



(6.1.18) Theorem. Let I = [a, b] be a compact interval. Then for each 
real-valued function a of bounded variation on I, 

ua= f{x)da{x) 

J a 

defines a bounded linear functional, with norm Ta{a,b), on the Banach 
space C{I). Moreover, each bounded linear functional onC{I) is of the form 
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Mof, where a is a function of hounded variation on I that is unique up to 
an additive constant. 



Proof. Throughout this proof, P = (a;o,a;i, ■ ■ ■ ,Xn) is a partition of I, 
and for each i, is any point of the interval [xi, Xi+\\. Consider first a real- 
valued function a of bounded variation on I. The linearity of Ua follows 
from Exercise (1.5.16: 4). For each / G C(I) we have the following inequality 
for Riemann-Stieltjes sums: 



n—1 



f(fi)(a:(x^+l) - a(xi)) 



2 = 0 



n—1 



\a{x,+i) - a{xi)\ 



i=0 



< _ 

< 11/11 Ta{a,b). 

In the limit as the mesh of the partition tends to 0 we obtain the inequality 

\ua{f)\<\\f\\Ta{a,b), 

which shows that the linear functional Ua has bound To,{a,b). 

Now consider any bounded linear functional u on C{I). By Corollary 
(6.1.4), there exists a norm-preserving extension of u to Referring 

to Exercise (6.1.17: 1), define a function a : J — > R by 

a(a:) = {x G I) . 

To show that a is of bounded variation on I, let P be as in the foregoing, 
and for each fc (0 < fc < n — 1) let 

CTfc = sgn {a{xk+i) - ct{xk)) ■ 

n—1 



Then 






||(()|| < 1, and 



E \oi{xk+i) - a{xk)\ = u^{4>) < 



|u*|| = IImII 






Hence a is of bounded variation on /, and 

Ta{a,b) < ||u|| . 

If, now, / is any element of C(/), consider the function 



n—1 









( 5 ) 
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which, again by Exercise (6.1.17: 1), belongs to ??.(/). We have 



n—1 

“(/) “ X! (,a{xk+i) - a{xk)) 



= |w*(/) - U*(V') 
<NI ll/-V'l|. 



Letting the mesh of the partition P tend to 0, we see from Exercise 
(6.1.17: 2) that |1/ — ij)\\ — 1 - 0; also, 



n—1 



/c=0 



a{xk)) ^ f f{x)da{x). 



Hence u{f) = Ua{f)- Moreover, from (5) and the first part of the proof, 
||u|| = Ta{a,b). 

Finally, the uniqueness, up to an additive constant, of the function a 
corresponding to the given bounded linear functional u on C{I) follows 
from Proposition (1.5.19). □ 



The full power of the Hahn-Banach Theorem is not needed to prove 
Theorem (6.1.18): for, as is shown on pages 106-110 of [40], it is possible 
to construct an extension of u to TZ{I) by elementary means. 

We say that a function /:/—>■ R of bounded variation on / = [a, b] 
is normalised if /(a) = 0. It is easy to show that the normalised elements 
form a closed, and therefore complete, linear subspace of the Banach space 

(ev(/),|MlbJ- 

(6.1.19) Corollary. Under the hypotheses o/Theorem (6.1.18), C(/)* is 
isometrically isomorphic to the Banach space of normalised functions of 
bounded variation on I. 



(6.1.20) Exercises 

.1 Let I = [a,b] be a compact interval. Prove that the normalised el- 
ements of BV{I) form a Banach space relative to the norm . 

Then prove Corollary (6.1.19). 

.2 Compute the norm of the bounded linear functional u defined on 
by 

°° f— 11" 

«(/) = E ^/(Vn). 

Tl 

n—1 

.3 Let Ai be a compact metric space, and u a linear functional on C{X) 
that is positive, in the sense that u{f) > 0 for all nonnegative / G 
C{X). Prove that u is bounded and has norm equal to u(l), where 1 
is the constant function x I on X. 
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.4 Let M be a bounded linear functional on C{X), where X is a com- 
pact metric space. Prove that there exist positive linear function- 
als v,w on C{X) such that u = v — w. (For / > 0 in C{X) let 
v{f) = sup {m( 5 ) : g G C{X), 0 < g < f} .) 



6.2 Separation Theorems 

In this section we use the Hahn-Banach Theorem to establish a number 
of geometric results about the separation of convex sets by a hyperplane. 
These results have many applications, including some significant ones in 
mathematical economics (see Appendix C) . 

If A is a subset of a vector space, and t G F, we define 

tA = {tx : X G A} . 



(6.2.1) Lemma. Let X he a normed space, and A a convex subset of X 
containing 0 in its interior. Then the Minkowski functional p : A — >■ R, 
defined by 

p{x) = inf {t > 0 : X G tA } , 

is a sublinear functional on X. If p{x) < 1, then x G A; and if A is open, 
then 

A = {x G A : p(x) < 1} . 



Proof. Choose r > 0 such that B{0,r) C A. If x yf 0, then 



«adB(0,,)cdH.4. 

r r 

It follows that p is defined throughout A. Let a, (3 he positive numbers 
such that X G aA and y G /3A; then 



X + y = {a + P) 
where, by convexity. 



ex P 



a ^x -I- 



P 



rT ) , 



cx /3 



a 



oc /3 



Of ^x -I- 



p 



(X P 



P y(3A. 



So X + y G (a -I- P)A. It now follows that p{x + y) < p{x) + p{y). On the 
other hand, if A > 0, then for all positive t we have 



and therefore 



Xx G t A ^ X G (a ^t) A 
t > p{\x) X~^t > p(x), 
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SO p{Xx) = Xp{x). This last equation also holds when A = 0, since p(0) = 0. 
Thus p is a sublinear functional on X. 

If p{x) < 1, then there exists t G (0, 1) such that t~^x G A; by the 
convexity oi A, x = {1 — t)0 + t{t~^x) belongs to A. 

Finally, suppose that A is open, and consider any x G A. Since p(0) = 0, 
to prove that p{x) < 1 we may assume that a; yf 0. Choose s > 0 such that 
B {x, s ||a;||) C A] then (1 + s)x G A, so p{x) < (1 + s)“^ <1. □ 

(6.2.2) Lemma. Let A he a nonempty open convex subset of a normed 
space X, and xq a point of X\A. Then there exists a hounded real-linear 
functional f on X such that f{x) < f{xo) for all x in A. 



Proof. By translation, we may assume that 0 G A; so, by Lemma (6.2.1), 
p{x) = inf {t > 0 : t~^x G A} 

defines a sublinear functional on X , and p{x) < 1 if and only if cc G A. 
Hence p{xo) > 1. Let Xq be the real linear subspace of X generated by 
{a;o}, and define a bounded real-linear functional /o on Xq by 

fo{Xxo) = X (AgR). 



If A > 0, then 



fo{Xxo) = A < Xp{xo) = p{Xxo); 



if A < 0, then 



/o(Aa;o) = A < 0 < p{Xxq). 



Thus fo{x) < p{x) for all x G Xq. By the Hahn-Banach Theorem (6.1.3), 
there exists a real-linear functional f on X such that 



• f{x) = fQ{x) for all X G Xq, and 

• f{x) < p{x) for all X G X. 



For all X G A, 

f{x) < p{x) < 1 = f{xo). 

It follows that the nonempty open set A is contained in the complement 
of the translated hyperplane Xq + ker(/); whence, by Exercise (4.2.5: 3) 
and Lemma (4.1.4), the hyperplane ker(/) is closed in X. It follows from 
Proposition (4.2.3) that / is bounded. □ 

(6.2.3) Proposition. Let C he a nonempty closed convex subset of a 
normed space X, and xq a point of X\C . Then there exist a hounded real- 
linear functional f on X, and a real number a, such that f{x) < a < /(xq) 
for all X G C. 
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Proof. Choose r > 0 such that B{xq, r) fl C = 0. By Exercise (4.1.5: 6), 
A= {x + y : X G C, y G B{0, r)} 

is open and convex; also, xq ^ A. By Lemma (6.2.2), there exists a bounded 
real-linear functional f on X such that f{x) < f{xo) for all x in A. Since / 
is not identically 0, f{b) > 0 for some b G B{0, r). Taking a = /(xq) — f{b), 
we see that for all x G C, 

f{x) = f{x + b) - f{b) < a < f{xo). □ 

(6.2.4) Exercises 

.1 Let be a compact convex subset of a real normed space X, and 
B a closed convex subset of X. Prove that there exist / G X* and 
a,/d G R such that f(x) < a < P < f{y) for all x G A and y G B. 

.2 Prove Belly’s Theorem: let .7^ be a finite family of convex subsets of 
R" with the property that the intersection of any n + 1 sets in T is 
nonempty; then P| .7^ is nonempty. (First use induction on the number 
of sets in then use induction on the dimension n.) 

.3 Let K he & convex subset of a normed space X, and S C K. We say 
that S is an extreme subset of K if, for any distinct points x,y oi K 
such that \{x + y) G S', we have x G S and y G S. If also S contains 
only one element, then that element is called an extreme point of K. 

Prove that the intersection of any family of extreme subsets of K is 
either empty or an extreme subset. 

Now suppose that K is also compact, and let S be the family of all 
extreme subsets of 7G, partially ordered by inclusion. Prove that S has 
a minimal element Sq. (Use the finite intersection property and Zorn’s 
Lemma.) Then prove that So consists of a single point. (Suppose that 
So contains two distinct points ^,77. Choose / G X* such that /(^) < 
/(?7), and let a = sup,,,^;^' f{x)- Show that Si = {x G So : /(x) = a} 
is an extreme subset of K such that So\Si yf 0.) 

Finally, prove that K has at least one extreme point. 

.4 By the convex hull of a subset ST of a normed space we mean the set 
of all elements of the form where the Xj are elements of K 

and the Xi are nonnegative real numbers such that = 1- Prove 

the Krein-Milman Theorem: a compact convex subset of a normed 
space X is the closure of the convex hull of the set of its extreme 
points. (Let C be the convex hull of the set of extreme points of the 
compact convex set, let xq G X\C, and apply Proposition (6.2.3).) 
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When X = R”, there is a weak extension of Proposition (6.2.3) to the 
case where C need not be closed. 

(6.2.5) Proposition. Let C he a nonempty convex subset of the Eu- 
clidean space R”, and xq a point of R”\C. Then there exists a hounded 
real-linear functional f on R" such that f{x) < f{xo) for all x € C. 

Proof. Since C is nonempty, closed, and convex. Proposition (6.2.3) al- 
lows us to assume that Xq G C\C. Then, by Exercise (4.1.5: 7), each open 
ball with centre xq contains some point of the complement of C. Choose a 
sequence (xfc) in R" that converges to xq, such that Xk ^ C for each k. By 
Proposition (6.2.3) and Theorem (5.3.1), for each k there exist pk G R" 
and Ofc G R such that 

• (x,pk) < Ofe for all X G C, and 

• (xk,Pk) = Oik. 

Replacing pk by ||pfe|| ^ Pk, we may assume that ||pfe|| = 1 for each k. 
Since the unit ball of R” is compact (Theorem (4.3.6)), we may pass to 
a subsequence and assume that {pk) converges to a limit p in R"; then 
IIpII = 1. Also, as 

Wk\ < Ibfcll Ibfcll = lixfcll 

and the sequence (||xfc||) , being convergent, is bounded, (a^) is a bounded 
sequence in R. Passing to another subsequence, we may further assume 
that («fc) converges to a limit a in R. By continuity, for all x G C we have 

(x,p) <a = (xo,p) . 

It remains to take /(x) = (x,p). □ 

Let H he & hyperplane in the normed space X, and a an element of X\H. 
By Propositions (4.2.4) and (6.1.1), for each a G R there exists a unique 
real-linear functional f on X such that 

a-\- H = {x € X ■. /(x) = a} . 

We say that the translated hyperplane a-\- H separates the nonempty sub- 
sets A and R of AT if /(x) < a for all x £ A, and /(x) > a for all 
X £ B. 

(6.2.6) Minkowski’s Separation Theorem. Let A and B be dis- 
joint nonempty convex subsets o/R”. Then there exists a closed translated 
hyperplane that separates A and B. 

Proof. The nonempty set 

C = B — A={x — y: x£B,y£A} 
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is convex, and 0 ^ C. By Proposition (6.2.5), there exists a bounded real- 
linear functional / on R" such that f{z) > /(O) = 0 for all z G C. Hence 
f{x) > f{y) for all X G A and y G B, and we need only apply Exer- 
cise (1.1.1:21) to obtain the required real number a. The corresponding 
hyperplane f~^ ({«}) then separates A and B. □ 

(6.2.7) Exercise 

Let A, B be disjoint nonempty convex subsets of a normed space 
X such that A is compact and B is closed. Prove that there exist a 
bounded real-linear functional / on X, and a real number a, such that 
f{x) > a for all x G A, and f{x) < a for all x G B. (Reduce to the 
case where A = {0}, note Exercise (4.1.5: 6), and apply Proposition 
(6.2.3).) 

6.3 Baire’s Theorem and Beyond 

In this section we prove one of the most useful theorems about complete 
metric spaces, Baire’s Theorem, and then study several of its many interest- 
ing consequences. Among these are the existence of uncount ably many con- 
tinuous, nowhere differentiable functions on [0, 1], and the Open Mapping 
Theorem for bounded linear mappings between Banach spaces. 

(6.3.1) Baire’s Theorem. The intersection of a sequence of dense open 
sets in a complete metric space is dense. 

Proof. Let A be a complete metric space, (C/„) a sequence of dense open 
subsets of X, and 

OO 

U=P\Ur,. 

n—1 

We need only prove that for xq G X and tq > 0, the set U 0 B{xo,ro) 
is nonempty. To this end, since U\ is dense in X, we can find x\ in 
U\ nR(a;o,ro). Moreover, since both U\ and B{xo,ro) are open, so is their 
intersection; whence there exists ri such that 0 < ri < 1 and 

B{xi,ri) C 17i n B{xo,ro). 

Since U 2 is dense in X, we can now find X 2 in U 2 O B{xi,ri); but U 2 O 
B{xi,ri) is open, so there exists T 2 such that 0 < r 2 < 1/2 and 

B{x 2 , T 2 ) C 1/2 n B{xi,ri). 

Carrying on in this way, we construct a sequence (a;„) of points of X, and a 
sequence (r„) of positive numbers, such that for each n > 1, 0 < r„ < 1/n 
and 

B^Xjij r n) GL UjiH B(^Xji—ij Xn—lf 
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By induction, \im>n, then Xm G B{xn,rn)', whence 

p{Xm,Xn) < Tn < ^ {m > u) . (1) 

Thus {xn) is a Cauchy sequence in X. Since X is complete, (a;„) con- 
verges to a limit Xao in X . Letting m tend to oo in inequality (1), we have 
p{xoo,x„) < r„, and therefore Xoo G for each n. Taking n = 0, 

we see that Xao G B{xo,ro); taking n > 1, we see that Xoo G C/„. □ 

The alternative name Baire Category Theorem for Theorem (6.3.1) 
originates from the following definitions (due to Baire). 

A subset S' of a metric space X is said to be 

• nowhere dense in X if the interior of S is empty; 

• of the first eategory if it is a countable union of nowhere dense subsets; 

• of the second category if it is not of the first category. 

Baire’s Theorem is equivalent to the statement a nonempty complete metric 
space is of the second category. 

(6.3.2) Exercises 

.1 Prove the last statement; more precisely, prove that if a nonempty 
complete metric space is the union of a sequence of closed sets, then 
at least one of those closed sets has a nonempty interior. 

.2 Prove the extended version of Cantor’s theorem on the uncountability 
of R (Exercise (1.2.11: 4)): if (x„) is a sequence of real numbers, then 
{x G R : Vn (x Xn)} is dense in R. 

.3 Prove that a nonempty complete metric space without isolated points 
is uncountable. 



We now show how Baire’s Theorem can be used to prove the existence 
of continuous functions on J = [0, 1] that are nowhere differentiable on I. 

For each positive integer n let be the set of all / G C{I) with the 
property: 

there exists t G [0, 1 — n“^] such that \f{t + h) — f{t)\ < nh 
whenever 0 < h < 1 — t. 



Note that U^i Bn contains any / G C{I) such that for some t G [0, 1) the 
right-hand derivative of / at t, 



f{t+) 



h 



lim 



f{t+h) - f{t) 



h 
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exists. To see this, consider such / and t. Choose a positive integer m such 
that t € [0, 1 — nj"^] and \ < n\. Next choose ho > 0 such that if 

0 < h < ho, then \f{t + h) — f{t)\ < nih. If ho = I — t, set n = ni. If 
ho <f — t, then for ho < h < 1 — t we have 

\f{t + h)-f{t)\<^^h, 
no 

where H-H denotes the sup norm on C(I); so, taking n = max {ni, 712 }, 
where the positive integer rz 2 > 2 ||/|| /ft-o, we have / G i?„. 

We prove that C(/)\if„ is dense and open in I. To this end, first let 
(/fe)fci be a sequence in that converges to a limit / in C{I). Then there 
exists a sequence (tk) in [0, 1 — n~^] such that 

\fk{tk + h) - fk{tk)\ < nh 

whenever fc > 1 and 0 < h < 1 — tk- Since [0, 1 — n“^] is sequentially 
compact, we may assume without loss of generality that (tk) converges to 
a limit t £ [0,1 — n“^].IfO</i<l — t, then for all sufficiently large k we 
have 0 < h < 1 — tk and therefore 

\f{t h) — f {t)\ < \f{t + h) — f {tk + h) \ + \f{tk + h) — fk{tk + h)\ 

+ \fk{tk + h) — fk{tk)\ + \fk{tk) — f{tk)\ 

+ \f{tk) - f{t)\ 

< \f{t + h)~ f{tk + h)\ + \\f - fk\\ + nh 
+ 11/ - fk\\ + \f{tk) - f{t)\ ■ 

Letting fc — >■ 00 and using the continuity of /, we obtain 

\f{t + h) - f{t) \ < nh. 

Hence / G if„, and therefore is closed in C{I). Thus C{I)\En is open 
in C{I). 

Given / G C(J) and e > 0, we now use the Weierstrass Approximation 
Theorem (4.6.1) to construct a polynomial function p such that ||/ — p\\ < 
e/2. Choosing a positive integer 

N > e~^ (n+ lb'll) , 

define a continuous function g : [0, 1] — 1 R such that for 0 < fc < iV — 1, 
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and q is linear on each of the intervals 



■ k 


k + ^ 




_]V’ 


N 


•) 



N 



k + I 
N 



Let g = p + q £ C{I). For each t G [0, 1) we have 

\g'{t~^)\ > \q'{t+)\ - \p'{t)\ >Ne- lip'll > n, 
so g ^ En- Since 

||/-ff|| < ||/-p|| + Ikll < e, 
we conclude that C{I)\En is dense in C{I). 

Now let En be the set of all f £ C{I) with the property: 

there exists t £ such that jf(t + h) — f(t)j < nh 

whenever 0 < h < t. 



Arguments similar to those just used show that C{I)\Fn is dense and open 
in C{I), and that it contains any / £ C{I) such that for some t £ (0, 1] the 
left-hand derivative of / at t, 



nn 



lim 

h — yO,h<C.O 



f{t + h) - f{t) 
h 



exists. Let 

OO OO 

^ = U u U F„. 

n—1 n—1 

Since C{I) is complete (Proposition (4.5.4)), we see from Baire’s Theorem 
that 

OO OO 

C{i)\s = n (C(/)\if„) n f| (C(J)\F„) 

n—1 n—1 

is dense in C{I). Clearly, C{I)\S consists of continuous, nowhere differen- 
tiable functions on I. 



(6.3.3) Exercises 

.1 Prove that [0,1] cannot be written as the union of a sequence of 
pairwise-disjoint closed sets. (Suppose that there exists a sequence 
(En) of pairwise-disjoint closed sets whose union is [0, 1] . Show that 
the union of the boundaries of the sets En is closed and has an empty 
interior.) 

.2 Let X be a Banach space, and C a closed convex subset of X that is 
absorbing — that is, for each x £ X there exists t > 0 such that tx G C. 
Prove that 0 does not belong to the closure of A1\C'. (Suppose the 
contrary, and show that for each positive integer n the complement 
of nC is dense and open in A".) 
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.3 Prove that if a Banach space is generated by a compact set, then it 
is finite-dimensional. 

.4 Let X be a complete metric space, and a family of continuous 

mappings of X into R. Suppose that for each x € X there exists 
Mx > 0 such that \fi{x)\ < for all i G I. Prove that there exist 
a nonempty open set E G X and a positive integer N such that 
\fi{x)\ < N for alH G / and all x £ E. (Let 

Cn,i = {x £ X ■. \fi{x)\ < n} , 

Cn = 

and use Baire’s Theorem.) 

A mapping / between metric spaces X and Y is called an open mapping 
if f{S) is an open subset of Y whenever S is an open subset of X. 

(6.3.4) Exercise 

Prove that a linear mapping T between normed spaces X, Y is open 
if and only if there exists r > 0 such that B{Q,r) C T{B{Q, 1)). 

We now aim to apply Baire’s Theorem to prove the following fundamental 
result on linear mappings between Banach spaces. 

(6.3.5) The Open Mapping Theorem. A bounded linear mapping of 
a Banach space onto a Banach space is open. 

The next lemma prepares us for the proof of this theorem. 

(6.3.6) Lemma. Let T he a linear mapping of a Banach space X into a 
normed space Y . Then T is open if and only if there exists r > 0 such that 
B{0,r)cT(B{0,l)). 

Proof. Suppose that such a real number r exists. In view of the pre- 
ceding exercise, it suffices to prove that B{0,r/2) C T(B(0, 1)). Given 
y £ B(0,r/2), since 2y £ B(0,r), we can find an element xi of the unit 
ball of X such that 

\\2y-Tx4 < §. 

So 2^y — 2Tx\ £ B(0,r), and therefore there exists X 2 in the unit ball of 
X such that 

\\2^y - 2Txi - Tx 2 \\ < §. 

Carrying on in this way, we construct a sequence (x„) of elements of the 
unit ball of X such that 

pNy _ 2 ^-^Txi - 2^-^Tx2 Txn\\ < i 
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for each N. Thus 

N 

n—1 

and therefore the series 2~^Txn converges to y. Since 

k k 

E2-"ikii<E2-” 

n=j n=j 

whenever k > j, we see from Exercise (4.1.8: 2) that 2“”a;„ converges 

to an element x in the unit ball of X . The boundedness of T now ensures 
that 

OO 

Tx=J 2 ‘^~"TXn = y. 

n—1 

Hence 5(0, r/2) C T(5(0, 1)), and therefore T is open. 

The converse is trivial. □ 

Proof of the Open Mapping Theorem. Let T be a bounded linear 
mapping of a Banach space X onto a Banach space Y. Then 

OO 

r = T{X) = U r(5(0,n)), 

n—1 

where each of the sets T (5(0,n)) is closed in Y. By Exercise (6.3.2: 1), 

there exists a positive integer N such that T{B {0, N)) has a nonempty 
interior; so there exist yi G Y and R> 0 such that 

B{y,,R)cT(B{0,N)). 

Setting 2 = N~^yi and r = N~^R, we easily see that 

5(0,r) CT(5(0,1)). 

So if y GY and ||y|l < r, then 

z±yGT(B (0,1)) 

and therefore 

y=l{{z + y)-{z- y)) G T{B (0, 1)). 

Hence 

5(0,r) CT(5(0,1)), 

and therefore, by Lemma (6.3.6), T is open. □ 
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The Open Mapping Theorem is one of a number of closely interrelated 
results. 

(6.3.7) Banach’s Inverse Mapping Theorem. A one-one bounded 
linear mapping of a Banach space onto a Banach space has a hounded 
linear inverse. 

Proof. Let T be a one-one bounded linear mapping of a Banach space X 
onto a Banach space Y . It is routine to prove that T~^ is a linear mapping 
from Y onto X. By the Open Mapping Theorem (6.3.5), if U is an open 
subset of X, then 

(y_i)_i(^) ^ 

is open in F; so T~^ is continuous, by Proposition (3.2.2), and is therefore 
a bounded linear mapping, by Proposition (4.2.1). □ 

By the graph of a mapping f : X ^ Y we mean the subset 
G{f) = :x&X} 

oi X X Y . (The graph of / is really the same as the function / itself, 
regarded as a set of ordered pairs.) 

(6.3.8) The Closed Graph Theorem. A linear mapping of a Banach 
space X into a Banach space Y is hounded if and only if its graph is a 
closed subset of X xY. 

Proof. Let T be a linear mapping of X into Y. It is a simple exercise to 
show that if T is bounded, then its graph is a closed subset of X x Y. 

Suppose, conversely, that G{T) is closed in X x F. Since X xY, a product 
of complete metric spaces, is complete (by Proposition (3.5.10)), we see 
from Proposition (3.2.9) that Q (T), which is clearly a subspace of X x F, 
is a Banach space. Define a mapping H of G{T) onto X by 

H{x,Tx) = X {x £ X). 

It is straightforward to show that H is one-one and linear. Also, 

||iL(x,ra;)||<||x|| + ||Tx|| 

< 2max{||a;|| , |lTa;||} 

= 2||(a:,Tx)||, 

so H is bounded. It follows from Banach’s Inverse Mapping Theorem (6.3.7) 
that H~^ is a bounded linear mapping of X onto G(T); but 

\\Tx\\<\\{x,Tx)\\ = \\H-^x\\<\\H-^W 11x11 
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for all X G X, and so T is bounded. □ 

We met the following result — the Uniform Boundedness Theorem — in 
Exercise (4.2.2: 14), where you were asked to fill in the details of a relatively 
little known elementary proof. We now place the Uniform Boundedness 
Theorem in its normal context, with its standard proof. 

(6.3.9) Theorem. Let (Ti)i^i be a family of bounded linear mappings 
from a Banach space X into a Banach space Y , such that {||Tix|| : i G 1} 
is bounded for each x G X. Then {||T)|| : i G /} is bounded. 

Proof. Our hypotheses ensure that for each x G X, 

Ux{i) = TiX {i G I) 

defines an element Ux of Clearly, the mapping x Ux of X into 

B{I, Y) is linear. We prove that its graph is closed in AT x B{I, Y). Indeed, 
if (xn) is a sequence converging to a limit Xoo in X, such that the sequence 
(ux„) converges to a limit / in B{I,Y), then for each i G I we have 

ll/(*) - Ux^{i)\\ < \\f{i) - Ux^{i)\\ + WuxA'i) - Ux^{i)\\ 

< \\f - Ux„\\ + \\Ti{x„ - x^)\\ 

< \\f-UxA + \\Ti\\ \\Xn-Xoo\\ 

— >■ 0 as n — >■ oo. 

Hence / = and so the linear mapping x Ux has a closed graph. By 
Proposition (4.5.1) and the Closed Graph Theorem (6.3.8), this mapping 
is bounded. Let 

c = supdlua;!! : xG X, ||x|| < 1} . 

Then for alH G / and all x in the unit ball of X, 

||7*a;|| = ||'Ua;(i)|| < Ilua;|| < c. □ 



(6.3.10) Exercises 

.1 Prove that if T is an open bounded linear mapping of a Banach space 
X into a normed space Y , then the range of T is complete. 

.2 Let X be a separable real Banach space with a Schauder basis (xn), 
and let S be the linear space consisting of all sequences (A„)))^;^ of 
real numbers such that the series ^nXn converges in X. Show 

that 

ll(A„)“ill = sup 

N>1 



N 



n—1 
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defines a norm on S. Prove that S' is a Banach space with respect to 
this norm. Then show that the mapping 

OO 

n—1 

is a bounded linear isomorphism of S onto X with a continuous 
inverse. Deduce that for each positive integer N the coordinate 
functional ^nXn ^ belongs to the dual space X* . 

.3 Use an argument like that of Exercise (6.3.3: 4) to give another proof 
of the Uniform Boundedness Theorem. 

.4 Let X, Y be Banach spaces, and suppose that for all distinct y, y' in 

Y there exists a bounded linear functional / on U such that f{y) yf 
f{y'). Let T : X ^ Y he & linear mapping such that if (x„) is a 
sequence in X converging to 0, then (/ o T) (x„) converges to 0 for 
each bounded linear functional / on Y . Prove that T is bounded. 
(Use the Closed Graph Theorem.) 

.5 Let (T„) be a sequence of bounded linear mappings of a Banach space 
X into a Banach space Y, such that the sequence (T„a;) converges in 

Y for each x G X. Prove that 



Tx = lim T„x 

n—^oo 

defines a bounded linear mapping T : X ^ Y. (Use the Uniform 
Boundedness Theorem.) 

.6 Let S,T be mappings of a Hilbert space H into itself such that 
(Sx,y) = (x,Ty) for all x,y G H. Prove that S and T are linear 
mappings. Then give two proofs that both S and T are bounded. 
(For one proof use the Closed Graph Theorem.) 

.7 Let A, B be disjoint subspaces of a Banach space X such that each 
element x of X can be written uniquely in the form 

X = PaX + Pbx 

with Pax G A and Pbx G B. Prove that the oblique projection map- 
pings Pa '■ X ^ a and Pb : X ^ B so defined are linear, and that 
they are bounded if and only if A and B are closed in X. 

.8 Prove Landau’s Theorem: if (a„) is a sequence of complex num- 
bers such that ^'^^lUnXn converges for each (cc„) G hiC), then 
(an) G hiC). (For each x = (x„) G hiC) and each k, define 
Sk(x) = X)n=i®n2;„. Apply the Uniform Boundedness Theorem to 
the sequence (sfc)^i of linear functionals on hiC), to show that the 
partial sums of bounded.) 
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.9 Let tu be a weight function on the compact interval / = [a,b\. For 
each positive integer n let {xn, 0 jXn,i, ■ ■ ■ ,Xn,n) be a partition of I, 
and define a linear functional on C(I) by 

n 

^nf ^ ^ 
k^O 

where each Cn,k ^ R- Prove Polyaks Theorem on approximate quadra- 
ture: in order that lim^^oo Ln f = /a fi^)w{x) dx for all / S C(/), it 
is necessary and sufficient that sup„>j^ ^ 

.10 Let T be an operator on a Hilbert space. Prove that ran(T) is closed 
if and only if ran(T*) is closed. (Suppose that ran(T) is closed, note 
Exercise (5.3.3: 6), and show that ran(T*T) is closed. To do so, let 
T*Txn — >■ and use the Uniform Boundedness Theorem to show 

that the linear functional Tx i— (a;,^) is bounded on the Hilbert 
space ran(r).) 

Perhaps the standard illustration of the Uniform Boundedness Theorem 
in action is the proof that there exists a 27r-periodic continuous function 
/:R^C whose Fourier series does not converge at 0. 

Let S denote the subspace of C°°(R, C) consisting of all 27r-periodic 
continuous mappings of R into C. Recall that the Fourier series, or Fourier 
expansion, of / G 5 at a; is defined to be 

OO 

s{f,x)= Yi 

n— — oo 

where the Fourier coefficients are given by 

fin) = T £ (nGZ). 

For each positive integer N let 

N 

s^{f,x)= Y 

n=-iV 



Then 

SN{f,x)=TJ f(^{^jjj^(^x-t)dt= T J f(-t)DN(t)dt, 

where the Dirichlet kernel Djsf is defined by 

N 

D^{t) = Y 

n=-AT 
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Define a linear mapping wat : 5— >■ R by 

unU) = Sat(/,0). 

Then 

\uN{f)\<^\\f\\ 

where ||/|| is the sup norm of /. Thus mat is bounded, and 

IImatII - ^ j (2) 

On the other hand, there exists a sequence (/„) of elements of S such that 

• — 1 < /n < 1 for each n, and 

• fn{t) sgn (Dff{t)) for each t G R; 

see Exercise (6.3.11: 2). Using Lebesgue’s Dominated Convergence Theorem 
(2.2.14), we now obtain 

1 /.TT I f'^ 

UN(fn) = ^ J fn{-t) Dn (t) dt ^ — J Sgn {D N {~t) ) D N (t) dt 



as n — >■ oo. But 



D.m = = K»(-o. 



Sin 



(I) 



so 



1 r'^ 

lim UN(fn) = 7T \DN{t)\ dt, 
n^oo Z7T 7 _ 7 t 

and therefore, in view of (2), 

^ /•TT 



\\un\\ = 



2tt 



\Dpf{t)\ dt. 



Next, noting that |sin (t/2)| < t/2 for all t > 0, we have 



\\un\\ = 



1 

27T 

1 

-.1 



' — TT 

>7T 



Dn (t)| dt 
sin {N + ^)t 



dt 



sin(l) 

2 1 

> — / - I sin (A^+ i)t| dt 

^ Jo t 



UW+i)7 



7T 



|sint| dt 
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2 - 1 /.«7T 

> — > — / |sini| dt 

^ N , 

= -F- 

n—1 

and therefore \\un\\ oo as N — >■ oo. By the Uniform Boundedness The- 
orem (6.3.9), there exists f £ S such that the set {|■u^r(/)| : iV > 1} is 
unbounded. Hence the Fourier series of / cannot converge at 0. 

(6.3.11) Exercises 

.1 Prove that 

sin (^N + 
sin (I) 

for each natural number N. 

.2 Prove that, in the notation of the preceding paragraphs, there exists 
a sequence (/„) of elements of S such that — 1 < /« < 1 for each n, 
and such that fn{t) — >■ sgn (DN{t)) for each t G R. 

.3 In view of the Riemann-Lebesgue Lemma (Exercise (2.3.3: 13)), 

defines a mapping T : Li[— 7r,7r] — >■ cq. Prove that T is one-one but 
not onto Cq. (Show that there exists a > 0 such that ||T(I?„)|| > 
a ||l?n|li for each n.) 



N 

Z = 

n^-N 




Appendix A 

What Is a Real Number? 



In this appendix we sketch Bishop’s adaptation of Cauchy’s construction 
of the set R, based on the idea that a real number is an object that can 
be approximated arbitrarily closely by rational numbers. 

Passing over the standard construction of the set Z of integers, we define 
a rational number to be an ordered pair (m, n) of integers, usually written 
m/n or such that n yf 0. Two rational numbers m/n and m! jn! are said 
to be equal, and we write m/n = m! /n! , if mn! and m!n are equal integers; 
this relation of equality is an equivalence relation. 

We should really define a rational number to be an equivalence class of 
ordered pairs of integers relative to the equivalence relation of equality that 
we have just introduced. In that case two rational numbers (equivalence 
classes) would be equal if and only if they were one and the same. However, 
it more closely reflects common practice if we follow the approach in which 
rational numbers are given by the integer pairs themselves, and equality of 
rational numbers is a defined notion (given by a certain condition on the 
integer pairs) rather than the logical one of identity.^ We follow a similar 
approach to the equality of real numbers in due course. In every case we 
use without further mention the standard symbol = to denote equality. 



^For example, from childhood we are led to consider the rational numbers 
I , I , and I as equal, not as representatives of some equivalence class. For another 
example, we consider the numbers 1,0- 999 • • • , and | to be equal although they 
are not logically identical (they are presented to us in different ways). 
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We omit the details of the familiar algebraic operations and the order 
relations >, > on the set Q of rational numbers. We identify the integer n 
with the rational number n/1. 

By a real number we mean a sequence x = of rational numbers 

that is regular in the sense that 

\xni-Xn\<^ + \ (m,nGN+). 

The term is called the nth rational approximation to the real number 
X. The set of real numbers is, of course, denoted by R. 

We identify a rational number r with the real number (r, r, r, ...) ; with 
that identification, Q, N, and Z become subsets of R. 

To specify completely the set R of real numbers, we must equip it with 
an appropriate notion of equality. Two real numbers x = (xn) and y = (yn) 
are said to be equal if 



\Xn -Vn\< ^ (n G N+). 

Note that this notion of equality is an equivalence relation: it is clearly 
reflexive and symmetric; its transitivity is a simple consequence of the 
following result. 

(A.l) Lemma. Two real numbers x = (x„) and y = (j/„) are equal if 
and only if for each positive integer k there exists a positive integer Nk such 
that \xn — yn\ < 1/fc whenever n > N^. 

Proof. If X = y, then for each k we need only take = 2k. Conversely, 
suppose that for each k there exists Nk with the stated property, and 
consider any positive integers n and k. Setting m = k + Nk, we have 

\Xn yn\ ^ \Xn | “t” \Xm I/m | “t” IVni I/n | 




Since this holds for all positive integers k, we see that \xn — yn\ < 2/n. But 
n is arbitrary, so x = y. □ 

(A. 2) Exercises 

.1 Complete the proof that equality of real numbers is an equivalence 
relation. 

.2 Let k be any positive integer. Show that the operation which assigns 
to each real number {x„)'^^i its kth. rational approximation Xk does 
not preserve equality. 
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.3 Prove that two real numbers x = (xn) and y = (j/„) are equal if and 
only if for each c > 0 and each positive integer k there exists Nk such 
that \xn — yn\ < c/k for all n > Ni^. 

To introduce the algebraic operations on R we need a special bound for 
the terms of a regular sequence x = (x„) of rational numbers. We define 
the canonical bound of x to be the least positive integer greater than 
|xi| + 2. It is easy to show that |x„| < for all n. 

The arithmetic operations on real numbers x = (cc„) and y = (?/„) are 
defined in terms of the rational approximations to those numbers as follows. 

+ y)n = + y 2 n, 

- y)n = ~ 2 / 2 ™, 

{xy)n = X 2 „„?/ 2 k«, where k = max{A:^, Ky}, 
max{a;, j/}„ = max{x„, y„}, 
min{x,y}n = min{a;„,y„}, 

\x\n = \Xn\ ■ 

Here, for example, (x + y)„ denotes the nth rational approximation to the 
real number x+y, and max{x„, y„} is the maximum, computed in the usual 
way, of the rational numbers Xn and ?/„. Of course, we must verify that the 
foregoing definitions yield real numbers; we illustrate this verification with 
the case of the product xy. Writing = x 2 K.ny 2 K.rn so that xy = (z„), for 
all positive integers m and n we have 

I Zm I — \^2Km{y2nm V2 K7l ) y2K.n{p^2 Km ^2/in)| 

\y2nm y2Kn\ |2/2/«n| |^2Km ^2Kn| 

- ^ + 2ik) + (diii + dbn) 




Thus xy is a regular sequence of rational numbers — that is, a real number. 

In the rest of this appendix, x = (x„), y = (?/„), and z = (z„) are real 
numbers. 

(A. 3) Exercises 

.1 Prove that \xn\ < Kx for each n. 

.2 Prove that x + y, x — y, maxja;,?/}, min{a;,?/}, and |cc| are real 
numbers. 

.3 Let x' and y' be real numbers such that x = x' and y = y' . Prove 
that X + y = x' + y' and xy = x'y' . 

Thus the operations of addition and multiplication arise from func- 
tions on the Cartesian product R x R when the relation of equality 
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on that set is defined in the natural way: (x,y) = (x',y') if and only 
if X = x' and y = y' . 

.4 Sums, differences, products, maxima, and minima of finitely many 
real numbers are defined inductively: for example, we define 

max{xi, . . . ,Xn+i} = max{max{xi, . . . , x„}, a;„+i}. 

Show that if O’ is a permutation of {1, ... , n}, then 

max{a;,^(i), . . .,Xa(n)} = max{xi, . . . ,x„}. 

.5 Prove each of the following identities. 

(i) x + y = y + x 

(ii) X + {y + z) = {x + y) + z 

(iii) xy = yx 

(iv) 0 + x = a; + 0 = a: 

(v) lx = xl = x. 

(These should serve to convince you that addition and multiplica- 
tion, as defined previously, have the properties that we expect from 
elementary school.) 

.6 Prove that for each m the mth rational approximation to 1/n — 
\x - Xn\ is 1/n - \x/^rn ~ Xn\ ■ 

The real number x = (x„) is said to be positive if there exists n such 
that Xn > 1/n. We define x > y to mean that x — y is positive; thus a; > 0 
if and only if x is positive. On the other hand, we say that x is 

• negative if —x is positive, and 

• nonnegative if > — 1/n for all n. 

We write x > y to denote that x — y is nonnegative, and we define x < y 
and x < y to have the usual meanings relative to the relations >, >. 

(A. 4) Exercise 

Prove that if a: > 0, then a; > 0. 

(A. 5) Proposition. A real number x = (xn) is positive if and only if 
there exists a positive integer N such that Xm > f/N for all m > N. On 
the other hand, x is nonnegative if and only if for each positive integer k 
there exists a positive integer Nk such that Xm > —1/k for all m > N^. 
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Proof. If X is positive, then > 1/n for some n. Choosing the positive 
integer N so that 2/N < a;„ — 1/n, for each m > N we have 



> Xn 

> Xn 

> Xn 
>^- 



\Xm Xn 

1 1 

m n 

I 1 

N n 



So the required property holds. If, conversely, that property holds, then 
xn+1 > 1/{N + 1), so X > 0. 

The proof of the second part of the proposition is left as an exercise. □ 



(A. 6) Exercises 

.1 Prove the second part of the preceding proposition. 

.2 Prove that if x = x', y = y' , and x > y (respectively, x > y), then 
x' > y' (respectively, x' > y'). 

.3 Prove the Axiom of Archimedes: if a; > 0 and y > 0, then there exists 
n G such that nx > y. 

.4 Prove that on Q the relations > and >, defined as for real num- 
bers, coincide with the standard elementary order relations between 
rational numbers. 

.5 Prove the triangle inequality for real numbers: |a; -I- j/| < |a:| -I- |y| . 



It is left as a relatively straightforward exercise to prove most of the 
elementary properties of the partial orders >, > on R. However, we need 
to tie up a few loose ends, the first of which concerns the order density of 
Q in R and requires a simple lemma. 



(A. 7) Lemma. \x — Xn\ < 1/n for each n. 

Proof. Fix the positive integer n. By Exercise (A. 3: 6), for each m the 
mth rational approximation to 1/n — \x — Xn\ is 

- — \xira — a;„| > - — (j^ + “) = ~ > — — . 

n I — n ''4m n' 4m m 

Hence 1/n — \x — Xn\ > 0, and therefore |a: — a;„| < 1/n. □ 



(A. 8) Proposition. Q is order dense in R — that is, for all x and y in 
R with X < y, there exists r G Q such that x < r < y. 



Proof. Since 



0 <y-x = {y 2 n - , 
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there exists N such that y 2 N — x^n > 1 /N. Writing 

r =\ {x2n + U2n) 
and using Lemma (A. 7), we have 



r - x>r - X 2 n - \x 2 N ~ a;| 

> \ {y2N — X2 n) ~ 2^ > 0: 

and similarly y — r > 0. Hence x < r < y. □ 

Here is a good application of Proposition (A. 8). 

(A. 9) Proposition. If x + y > 0, then either x > 0 or y > 0. 

Proof. Let x + y > 0. By Proposition (A. 8), there exists a rational 
number a such that 0 < a < x + y. Using Exercise (A. 6: 3), choose a 
positive integer n > 4/a. Let r = x„ and s = yn- Then r and s are 
rational; also, by Lemma (A. 7), |x — r| < a/4 and |y — s| < a/4. Using the 
triangle inequality, we now see that 

r + s>(x + y) — (|x — r| + |?/ — s|) 

>«-(! + !) 

a 

~ 2 • 

Since r and s are rational numbers, either r>a/4ors>a/4. In the first 
case, X > r — |x — r| > 0 ; in the second, y > 0. □ 

For each nonzero real number x the reciprocal^ or inverse^ of x is the real 
number ^ (also written 1/x or x~^) defined as follows. Choose a positive 
integer N such that |x„| > 1/N for all n> N, and set 

l/xj^3 if n < N 

1 jxnN^ if n> N. 

The last set of exercises in this appendix shows that this is a good definition 
of 1/x. 

(A. 10) Exercises 

.1 Let X be a nonzero real number, and 1/x the reciprocal of x as just 
defined. Prove that 1/x is a real number, and that it is the unique 
real number t such that xt = 1. 
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2 Let a; be a nonzero real number, and let N be as in the definition of 
1/x. Let M be a positive integer such that \xn\ > 1/M for all n > M, 
and define a real number y = (y„) by 

{ IjxM^ if n < M 

^/xnM'^ if n > M. 

Give two proofs that y = 1/x. 

3 Prove that the operation that assigns 1 /x to the nonzero real number 
X is a function (respects equality) and maps the set of nonzero real 
numbers onto itself. 




Appendix B 

Axioms of Choice and Zorn’s 
Lemma 



In the early years of this century it was recognised that the following princi- 
ple, the Axiom of Choice, was necessary for the proofs of several important 
theorems in mathematics. 

AC If if is a nonempty family of pairwise- disjoint nonempty sets, then 
there exists a set that intersects each member of T in exactly one 
element. 

In particular, Zermelo used this axiom explicitly in his proof that every set 
S can be well-ordered — that is, there is a total partial order > on S' with 
respect to which every nonempty subset of X has a least element [57]. It 
was shown by Godel [18] in 1939 that the Axiom of Choice is consistent 
with the axioms of Zermelo-Fraenkel set theory (ZF), in the sense that 
the axiom can be added to ZF without leading to a contradiction, and 
by Cohen [11] in 1963 that the negation of the Axiom of Choice is also 
consistent with ZF. Thus the Axiom of Choice is independent of ZF: it can 
be neither proved nor disproved without adding some extra principles to 
ZF. 

The Axiom of Choice is commonly used in an equivalent form (the one 
we used in the proof of Lemma (1.3.5)): 

AC' If A and B are nonempty sets, S C A x B, and for each x € A 
there exists y G B such that {x, y) G S, then there exists a function 
f : A ^ B — called a choice function for S — such that {x, f{x)) G S 
for each x G A. 
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To prove the equivalence of these two forms of the Axiom of Choice, first 
assume that the original version AC of the axiom holds, and consider 
nonempty sets A, B and a subset S' of A x such that for each x € A 
there exists y & B with {x, y) G S. For each cc G A let 

= {x} X {y e B : {x, y) G S} . 

Then T = is a nonempty family of pairwise-disjoint sets, so, by 

AC, there exists a set C that has exactly one element in common with each 
Fx ■ We now define the required choice function / : A — >■ by setting 

{x, f{x)) = the unique element of C fl 

for each a; G A. 

Now assume that the alternative form AC' of the Axiom of Choice holds, 
and consider a nonempty family T of pairwise-disjoint nonempty sets. 
Taking 



A = F, 

B= U X, 

S= {(X,x) : X G F, a; G X} 
in AC', we obtain a function 

/ : F ^ U X 

such that /(AT) G X for each X € F. The range of / is then a set that has 
exactly one element in common with each member of T . 

There are two other choice principles that are widely used in analysis. 
The first of these, the Principle of Countable Choice, is the case A = N of 
AC'. The second is the Principle of Dependent Choice: 

If a € A, S C A X A, and for each x € A there exists y & A 
such that {x,y) G S, then there exists a sequence in A 

such that a\ = a and (o„, o„+i) G S for each n. 

It is a good exercise to show that the Axiom of Choice entails the Principle 
of Dependent Choice, and that the Principle of Dependent Choice entails 
the Principle of Countable Choice. Since the last two principles can be 
derived as consequences of the axioms of ZF, they are definitely weaker 
than the Axiom of Choice. 

There are many principles that are equivalent to the Axiom of Choice. 
One of those, Zorn’s Lemma, is needed for our proof of the Hahn-Banach 
Theorem in Chapter 6. 

A nonempty subset C of a partially ordered set (A, is called a chain 
if for all x,y € C either x y or y x. Zorn’s Lemma states that 
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If every chain in a partially ordered set A has an upper bound 
in A, then A has a maximal element. 

For a fuller discussion of axioms of choice, Zorn’s Lemma, and related 
matters, see the article by Jech on pages 345-370 of [2]. 




Appendix C 

Pareto Optimality 



In this appendix we show how some of the results and ideas in our main 
chapters can be applied within theoretical economics. 

We assume that there are a finite number m of consumers and a finite 
number n of producers. Consumer i has a consumption set Xi C R^, where 
a consumption bundle Xi = (xjj , . . . , G Xi is interpreted as follows: 
Xi^. is the quantity of the fcth commodity (a good or a service) taken by 
consumer i when he chooses the consumption bundle Xi. Producer j has 
a production set Yj C R'^, where the /cth entry in the production vector 
yj = , • ■ • , 2/jjv ) G r,- is interpreted as the amount of the kth commodity 

produced by producer j under her adopted production schedule. Other 
important sets in this context are the aggregate consumption set 

X = Xi + • • • + Xm 
and the aggregate production set 

r = + . . . + 

A price vector is simply an element p of R^; the /cth component pk of 
p is the price of one unit of the fcth commodity. Thus the total cost to 
consumer i of the consumption bundle Xi is {p,Xi), where (•, •) denotes the 
usual inner product on R^ ; and the profit to producer j of the production 
vector pj is {p,Pj) ■ 

We assume that the preferences of consumer i are represented by a re- 
flexive, transitive total partial order on Xi, called the preference relation 
of consumer i. The corresponding relations )^i of strict preference, and 
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of preference-indifference , are defined on Xi as follows. 

X )^i y if and only if xipiy and not {y )pi x ) ; 

X y if and only if xipiy and y >pi x. 

Routine arguments show that )^i and are transitive; that x yi x is 
contradictory; that if either x yi y or x y, then x ipi y; and that if 
either x yi y z or x y yi z, then x yi z. The informal meaning of 
X y is that consumer i finds x at least as attractive as y; x yi y means 
that he strictly prefers x to y; and x y signifies that he does not mind 
which of a: or y he obtains. 

It is convenient to introduce consumer i’s upper contour set at x, 

[x, ->) = {f e X, : ^ x} , 
and his strict upper contour set at x, 

(x, -y) = {^ £ Xi : ^ yi x} . 

The preference relation yi is said to be locally nonsatiated at Xi € Xi if for 
each e > 0, B{xi,s) fl (x^,— >■) is nonempty — that is, there exists x' G Xi 
such that \\xi — x'|| < s and x' yi Xi. 

By a chosen point for consumer i under the price vector p we mean a 
point f,i G Xi such that for all Xi £ Xi, 



or, equivalently. 



'y i ^ ^ (P? ^z) ■ 



(C.l) Lemma. If £,i £ Xi is a chosen point for consumer i under the 
price vector p, and Xi ^ £,i is a point of Xi at which yi is locally nonsatiated, 
then {p, Xi) > {p, . 



Proof. Suppose that (p,Xi) < {p,fi). By the continuity of the mapping 
X !->■ {p, x) on R^, there exists r > 0 such that if x' G Xi and ||x' — Xi|| < r, 
then (p, x') < (p, fi) . As )pi is locally nonsatiated at Xj, there exists x' G Xi 
such that X- yi Xi and ||x' — Xi\\ < r. Then (p, > (p, x'f ) ; so ipi x', as 
fi is a chosen point. But we also have x' yi Xi ^i ^i and therefore x' yi f,i, 
a contradiction. □ 

We now assume that consumer i has an initial endowment of com- 
modities, represented by the vector Xi = {xi^, . . . ,Xij^). The total initial 
endowment of all consumers is then 



X = Xi H h Xm G X. 
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We say that an element (j/i, . . . , yn) of x • • • x is an admissible array 
of production vectors; and that an element (xi, . . . , Xm) of Xi x ■ ■ ■ x Xm is 
a feasible array of consumption bundles if there exists an admissible array 
{yi, . . . ,yn) of production vectors such that 

m n 

j=i 

Intuitively, a feasible array is one that can be obtained by a distribution of 
the total initial endowment and the total of the production vectors under 
some production schedule. 

An array (^i, . . . , £ Xi x ■ ■ ■ x Xm of consumption bundles is said to 

be Pareto optimal, or a Pareto optimum, if it is feasible and if the following 
condition holds. 

PO If {x\, . . . , Xm) is a feasible array such that Xi )^i for some i, then 
there exists k such that Xk- 

Equivalently, the array is Pareto optimal if there is no feasible array {x \, . . . , 
Xm) such that Xi )pi for all i, and such that Xi f,i for at least one i. 

By a competitive equilibrium we mean a triple consisting of a price vector 
p, an array (^i, . . . ,^m) of consumption bundles, and an admissible array 
{rji,. . . , rjn) of production vectors, satisfying the following conditions. 

CEl For 1 < i < TO, is a chosen point for consumer i under the price 
vector p. 

CE2 For 1 < j < n, if yj G Yj, then (p,r]j) > (p,yj). 

CE3 EZi^^ = E;=iVj+x- 

Condition CEl expresses consumer satisfaction; CE2, profit maximisation; 
and CE3, feasibility. 

(C.2) Proposition. Assume that each is locally nonsatiated, and let 



{P, (771,..., 7?„)) 



be a competitive equilibrium. Then (Ci, ■ • ■ ,£, 771 ) is a Pareto optimum. 

Proof. Condition CE3 ensures that is a feasible array of 

consumption bundles. Suppose that (^i, . . . , £m) is not a Pareto optimum. 
Then there exist an array (xi, . . . , Xm) of consumption bundles and an 
admissible array (j/i, . . . , ?/„) of production vectors such that 

m n 

^ X, = ^ + X, 

i=i j=i 



( 1 ) 
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Xi ii for all i, and Xk )^k Cfc for some k. By CEl, if Xi )^i ^i, then 
{p,x^) > (p,^i); in particular, (p,Xk) > (p,^k)- If >i x^, then Xi 
and so, by Lemma (C.l), (p,Xi) > (p,^i). Thus 



Hence 



i=l 

n 

= ’ll) + ^p^ 



i=i 



i=i 



^,[^Xi-^yj -x] ) > 0 , 
0=1 i=i 



and therefore, by the Cauchy-Schwarz inequality in , 

m n 

i=l i=i 



This contradicts (1). □ 

Our next aim is to establish a partial converse of Proposition (C.2), pro- 
viding conditions under which a Pareto optimum gives rise to a competitive 
equilibrium. We first introduce some more definitions. 

The preference relation Oi on Xi is said to be convex if 

• Xi is convex, 

• X "^i x' ^ tx + {1 — t)x' x' whenever 0 < t < 1, and 

• X ^i x' ^ tx + {1 — t)x' Oi x' whenever 0 < t < 1. 

In that case the sets [a;,— >■) and (x,— >■) are convex. 

We say that consumer i is nonsatiated at G Xi if there exists x € Xi 
such that X >~i ^i; otherwise, we say that he is satiated at 

(C.3) Proposition. Let (^i, . . . , be a Pareto optimum such that for 
at least one value of i, consumer i is nonsatiated at ^j, and let {pi , . . . , Pn) 
be an admissible array of production vectors. Suppose that )pi is convex for 
each i, and that the aggregate production set Y is convex. Then there exists 
a nonzero price vector p such that 

(i) for each i, if Xi € Xi and Xi >pi f,i, then {p,Xi) > {p,fi ) ; 
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(ii) for each j, ifyj G Yj, then (p,r]j) > (p,yj) . 

Proof. We may assume that consumer 1 is nonsatiated at Choose an 
admissible array (rji, . . . ,r]n) of production vectors such that 

m n 

i=i j=i 

Let A be the algebraic sum of the sets (Cij— >■) and 

A = G : a;i and Vi > 2 (xi ^i)| , 

and let 

B = ix £ :3y£Y (x = j/ + x)| . 

Clearly, B is convex; by our convexity hypotheses, A is convex, li AC\ B 
is nonempty, then there exist xi fi, Xi fi (2 < i < m), and yj G 
Yj (1 Y j < n), such that 

m 71 

^Xi = ^y^+ X. 

*=i i=i 

This contradicts the hypothesis that is a Pareto optimum. 

Hence A and B are disjoint subsets of R'^. Since these sets are clearly 
nonempty, it follows from Minkowski’s Separation Theorem (6.2.6) and the 
Riesz Representation Theorem (5.3.1) that there exist a nonzero vector 
p G R'^ and a real number a such that {p, x) > a for all x £ A, and 
(Pix) < a for all x £ B. Since f £ B, we have {p,f.) < a. We now show 
that (p, f) = a. 

To this end, consider a^ijwith x\ and Xi fi {2 < i < m). 

For 0 < t < 1 define 

Zi{t) = tXi + (1 - t)C* (1 < * < w) 



and 

m 

z(t) = ^Z,(t). 

i=l 

Since )p=i is convex for each i, 

^i(^) G (?i) “^) ) 

Zi{t) £ [^i,-l>) (2 < i < to) ; 

whence z{t) £ A and therefore (p,z{t)) > a. Letting t — >• 0 and using the 
continuity of the mapping x i— (p, x) on R^, we see that (p, C) > a and 
therefore that (p, ^) = a, as we wanted to show. 
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It now follows that (p, x) > {p, for all x £ A, and that {p, x) < {p, 
for all a; G B. Thus if is an admissible array of production 

vectors, then 

n \ j n 

+ x) < {p, 0 = + x 

i=i / \ 

and therefore 

n n 

XI (P’ Vi) ^ X ■ 

i=i t=i 

Given j £ n}, and taking yj £ Yj and pk = Pk for all fc yf j (1 < 

k < n), we now obtain (j>,Pj) > {p,yj). This completes the proof of (ii). 

A similar argument, using the fact that {p,x) > (p, for all x £ A, 
shows that 

{p,xi) > (p,^i) for all a;i G (2) 

and that (p,Xi) > (p,^i) for all Xi £ [^i,— >■) (2 < i < m). To complete the 
proof of (i), we show that if xi ^i, then (p, xi) > (p,^i). To this end, we 
recall that consumer 1 is nonsatiated at ^i, so there exists x\ £ Xi with 

x'l Cl ~i xi. 

It follows from this and the convexity of that for each t £ (0, 1), 
x{(t) = tx{ + (1 - t)xi Ci; 

whence (p, x^(t)) > (p, Ci), by (2). The continuity of the function x i-G- (p,x) 
on now ensures that (p, xi) > (p,^i), as we required. This completes 
the proof of (i) . □ 



(C.4) Corollary. Under the hypotheses of Proposition (C.3), suppose 
also that the following conditions hold. 

(i) For each price vector p and each z (1 < z < m), there exists G Xi 
such that (p, Ci) < (P)Ci) (cheaper point condition). 

(ii) For each z (1 < z < m), (Ci;“^) *s open in Xi. 

Then (p, (Ci, • . • , Cm), (zyi, • ■ • , Pn)) is a competitive equilibrium. 

Proof. In view of Proposition (C.3), we need only prove that CEl holds. 
To this end, let Xj Cz, and choose C( G Xi as in hypothesis (i). Then, by 
Proposition (C.3), Cz >~i C(. For each t £ (0, 1) define 



X^(t) = t^'i + (1 - f)Xi. 
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As — >■) is open in Xj, we can choose t G (0, 1) so small that Xi(t) >~i 

Then, by Proposition (C.3), 

t (P. Q + (1 - ^) {P, Xi) = {p, Xi{t)) 

= t{p,^^) + (1 - 1) (p,6) 



Hence 

(1 - t) {p,Xi) > (1 - t) 

and therefore (p,Xi) > (p,^i) ■ Thus is a chosen point. □ 

The cheaper point assumption cannot be omitted from the hypotheses 
of Corollary (C.4); see pages 198-201 of [51]. 
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Kernel, 186 

Korovkin’s Theorem, 212, 215 
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Landau’s Theorem, 287 
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limit of a sequence, 20, 139 

limit point, 48, 138 

limit superior, 24 
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Mean Value Theorem, 57 
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negative, 12, 294 
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orthogonal family, 242 
orthonormal, 242 
orthonormal basis, 247 
oscillation, 45 
outer measure, 79 
outer measure, hnite, 80 

P-adic metric, 127 
p-power summable, 196 
parallelogram law, 236 
Pareto optimum, 305 
Parseval’s identity, 248 
partial order, 6 
partial sum, 27, 180 
partially ordered set, 6 
partition, 62 
path, 163 

path component, 164 
path connected, 163 
Peano’s Theorem, 228 
period, 215 
periodic, 215 
Picard’s Theorem, 223 
points at infinity, 129 
pointwise, 4 

polarisation identity, 255 
Polya’s Theorem, 288 
positive, 12, 294 
positive integers, 3 
positive linear operator, 212 
positively homogeneous, 261 
power series, 31 
precompact, 149 
preference relation, 303 
preference relation, convex, 306 
preference-indifference, 304 
prehilbert space, 234 
preorder, 6 
price vector, 303 
primitive, 69 
producer, 303 
product metric, 165 
product norm, 176 
product normed space, 176 
product of paths, 164 
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production set, 303 
production vector, 303 
projection, 166, 240 
pseudometric, 127 
Pythagoras’s Theorem, 238 

Quotient norm, 181 
quotient space, 181 

Radius, 130 

radius of convergence, 31 
ratio test, 29 

rational approximation, 292 
rational complex number, 193 
rational number, 291 
rational numbers, 3 
real line, extended, 129 
real number, 12, 292 
real number line, 11 
real part, 19 
rearrangement, 34 
reciprocal, 296 
refinement, 62 
reflexive, 253, 266 
reflexive, 5 
regular, 292 

remainder term, Cauchy form, 59 
remainder term, Lagrange form, 59 
representable, 272 
representation, 187 
Riemann integrable, 64 
Riemann integral, 64 
Riemann sum, 67 
Riemann-Lebesgue Lemma, 112 
Riemann-Stieljtes integrable, 72 
Riemann-Stieltjes integral, 72 
Riemann-Stieltjes sum, 72 
Riesz Representation Theorem, 252 
Riesz’s Lemma, 190 
Riesz-Fischer Theorem, 198 
right hand derivative, 280 
Rodrigues’ formula, 252 
Rohe’s Theorem, 56 
root test, 30 

Satiated, 306 
Schauder basis, 269 
second category, 280 
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second dual, 253 
self-map, 149, 220 
selfadjoint, 254 
seminorm, 261 
separable, 132 
separates, 278 
sequence, 4 

sequentially compact, 149 
sequentially continuous, 45, 140 
series, 27, 180 
simple function, 116 
smallest element, 8 
step function, 99 
Stone- Weierstrass Theorem, 216, 
219 

strict partial order, 6 

strict preference, 303 

strictly decreasing, 8 

strictly increasing, 8 

subadditive, 261 

subcover, 47, 146 

subfamily, 5 

sublinear, 261 

subsequence, 4 

subspace, 131, 176 

subspace of a prehilbert space, 234 

sufficiently large, 20 

sum, 180 

sup norm, 175, 204 
supremum, 7 
supremum norm, 175 
supremum of a function, 8 
symmetric, 5 

Taxicab metric, 126 
Taylor expansion, 61 
Taylor polynomial, 58 
Taylor series, 61 
Taylor’s Theorem, 58 
term, 4, 27 
termwise, 5 

Tietze Extension Theorem, 144 
topological space, 134 
topology, 135 
total, 193 
total order, 6 
totally bounded, 149 
totally disconnected, 160 
transitive, 6 



translation invariant, 81, 99 
transported, 128 
triangle inequality, 15, 126, 174 
triple recursion formula, 251 

Ultrametric, 127 
unconditionally convergent, 180 
uncountable, 4 

Uniform Boundedness Theorem, 
186, 286 

Uniform Continuity Theorem, 49, 
154 

uniformly approximated, 212 
uniformly continuous, 49, 142 
uniformly convex, 186 
uniformly equicontinuous, 209 
unit ball, 174 
unit vector, 174 
upper bound, 7 
upper contour set, 304 
upper integral, 63, 73 
upper limit, 24 
upper sum, 63, 73 
Urysohn’s Lemma, 146 

Variation, 71 

Vital! covering, 82 

Vital! Covering Theorem, 82 

Weak solution, 257 
Weierstrass Approximation 
Theorem, 212 
Weierstrass’s M-test, 46 
weight function, 235 

Zermelo, 299 
Zorn’s Lemma, 300 
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